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(54) Title: GENE EXPRESSION SIGNATURES, METHODS AND COMPOSITIONS FOR DIAGNOSING DISORDERS OF 
THE LUNG 



O (57) Abstract: A minimally invasive sample procurement method for obtaining airway epithelial cell RNA that can be analyzed by 
^ expression profiling, e.g., by array-based gene expression profiling, is disclosed. These methods can be used to identify patterns of 
Q gene expression that are diagnostic of lung disorders, e.g., cancer, to identify subjects at risk for developing lung disorders and to 
^ custom design an array, e.g., a nucroarray, for the diagnosis or prediction of lung disorders or susceptibility to lung disorders. Arrays 
and informative genes arc also disclosed for this purpose. 
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GENE EXPRESSION SIGNATURES, METHODS AND COMPOSITIONS FOR 
DIAGNOSING DISORDERS OF THE LUNG 

RELATED APPLICATIONS 

This application claims the benefit of U.S. Provisional Application No. 
5 60/477,218, filed June 10, 2003, U.S. Provisional Application No. 60/483,387, filed 
June 27, 2003, and U.S. Provisional Application No. 60/497,599, filed August 25, 
2003. 

The entire teachings of the above applications are incorporated herein by 
reference. 

10 GOVERNMENT SUPPORT 

The invention was supported, in whole or in part, by grant ES00354 from 
The Katherine and Roland Douglass Fund (American Cancer Society) and by grant 
HL07035 from the National Institute of Health. The Government has certain rights 
in the invention. 

15 BACKGROUND OF THE INVENTION 

Lung cancer claims more than 150,000 lives every year in the United States, 
exceeding the combined mortality from breast, prostate and colorectal cancers. 
Cigarette smoking is the most predominant cause of lung cancer. Presently, 25% of 
the U.S. population smokes, but only 10% to 15% of heavy smokers develop lung 

20 cancer. Former smokers remain at risk for developing cancer and now constitute a 
large reservoir of new lung cancer cases. Approximately 85% of all subjects with 
lung cancer die within three years of diagnosis. Unfortunately survival rates have 
not changed substantially in the past several decades, in large part because there are 
no effective methods for identifying smokers who are at highest risk for developing 
25 lung cancer and no effective tools for early diagnosis. 
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One major hurdle in developing an early detection screen for lung cancer i 
that present methods for diagnosis require removal of tissue from inside the lung. 
Moreover, while it appears that a subset of smokers are more susceptible to the 
carcinogenic effects of cigarette smoke and are more likely to develop lung cancer 
5 particular risk factors, and particularly genetic risk factors, for individuals have go 
largely unidentified. 



SUMMARY OF THE INVENTION 

Work described herein demonstrates the utility of a minimally invasiVe 
sample procurement method and gene expression-based tools for the diagnosis of 

10 diseases of the lung, particularly lung cancer. Work described herein is further 
based on the identification of unique sets of expressed genes associated with 
smokers and non-smokers that constitute expression signatures. Epithelial cell gene 
expression profiles obtained from relatively accessible sites can provide important 
diagnostic and therapeutic information which can be applied to diagnose and treat 

1 5 lung disorders. For example, the gene expression profiles or signatures disclosed 
herein can be used to distinguish between, e.g., non-smokers and smokers, smokers 
and smokers with cancer, non-smokers and smokers with cancer, or a combination 
thereof. 

In particular, work described herein provides a minimally invasive sample 
20 procurement method for obtaining airway epithelial cell RNA that can be analyzed 
by expression profiling, e.g., by array-based gene expression profiling. These 
methods can be used to determine how airway epithelial cell gene expression 
profiles are affected by cigarette smoke and how these profiles differ in smokers 
with and without lung cancer. These methods can also be used to identify patterns 
25 of gene expression that are diagnostic of lung disorders, e.g., cancer, and to identify 
subjects at risk for developing lung disorders. All or a subset of the genes identified 
according to the methods described herein (e.g., the genes disclosed in FIGS. IA- 
IN, 2A-2F, 3A-3D, 4A-4P, 5A-5F, 6A-6R, 7A-7L and 8A-8E) can be used to design 
an array, e.g., a microarray, specifically intended for the diagnosis or prediction of 
30 lung disorders or susceptibility to lung disorders, optionally in combination with 
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probes intended for the diagnosis or prediction of other disorders (e.g., in 
combination with probes intended for the diagnosis or prediction of other types of 
cancer), based on the gene expression signatures exhibited by smokers and non- 
smokers as shown in FIGS. 6A-6R. For example, an array can be designed which 
5 has probes for one or more of the genes disclosed herein immobilized thereon/ The 
efficacy of such custom-designed arrays can be further tested, for example, in a large 
clinical trial of smokers. 

In one embodiment, the invention relates to a method of distinguishing a 
biological sample from a non-smoker and a biological sample from a smoker, and/or 

10 a biological sample from a smoker and a biological sample from a smoker with 
cancer, and/or a biological sample from a non-smoker and a biological sample from 
a smoker with cancer, wherein the method comprises obtaining a biological sample 
from an individual to be diagnosed; determining the level of expression of one or 
more informative genes in said sample, and comparing the determined level of 

15 expression with the corresponding level of expression in an appropriate control (e.g., 
one or more of a non-smoker, a smoker and a smoker with cancer). In a preferred 
embodiment, the biological sample is obtained from an airway epithelial cell. In one 
embodiment, the airway epithelial cell is obtained from a bronchoscopy or buccal 
mucosal scraping. 

20 In a preferred embodiment the informative genes are selected from the group 

consisting of the genes in FIGS. 1A-1N, 2A-2F, 3A-3D, 4A-4P, 5A-5F, 6A-6R, 7A- 
7L and 8A-8E. Preferably the expressions of two or more, five or more, ten or more, 
fifteen or more, twenty or more, fifty or more, one hundred or more, or one thousand 
or more informative genes are determined. 

25 In a particular embodiment, the biological sample is a nucleic acid sample, 

and in a preferred embodiment the level of expression of one or more informative 
genes is determined using oligonucleotides which hybridize to the one or more 
informative genes. In one embodiment the oligonucleotides are immobilized on a 
substrate. In a particular embodiment, the nucleic acid sample is RNA. 
30 In another particular embodiment, the biological sample comprises a gene 

expression product, and in a preferred embodiment the level of expression of one or 
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more infonnative genes is detennined using polymers (e.g., polypeptides) which 
bind to gene expression products encoded by the one or more infonnative genes. In 
a particular embodiment the polymers are immobilized on a substrate. 

In one embodiment, the invention relates to a method of diagnosing a disease 
5 of the lung, in particular lung cancer, comprising obtaining a biological sample from 
an individual to be diagnosed; and determining the expression of one or more 
informative genes in said sample, wherein increased expression of an informative 
gene whose expression is increased in individuals having a disease of the lung, or 
decreased expression of an informative gene whose expression is decreased in 
10 individuals having a disease of the lung, is indicative of a disease of the lung in the 
individual. In one embodiment, individual to be diagnosed is an individual who has 
been exposed to tobacco smoke, an individual who has smoked, or an individual 
who smokes. In a preferred embodiment, the biological sample is obtained from an 
airway epithelial cell. In one embodiment, the airway epithelial cell is obtained from 
1 5 a bronchoscopy or buccal mucosal scraping. 

In a preferred embodiment of the method, the informative genes are selected 
from the group consisting of the genes shown in FIGS. 1 A-1N, 2A-2F, 3A-3D, 4A- 
4P, 5A-5F, 6A-6R, 7A-7L and 8A-8E. Preferably the expressions of two or more, 
five or more, ten or more, fifteen or more, twenty or more, fifty or more, one 
20 hundred or more, or one thousand or more informative genes are detennined. 

In one embodiment, the disease of the lung is selected from the group 
consisting of asthma, chronic bronchitis, emphysema, primary pulmonary 
hypertension, acute respiratory distress syndrome, hypersensitivity pneumonitis, 
eosinophilic pneumonia, persistent fungal infection, pulmonary fibrosis, systemic 
25 sclerosis, ideopathic pulmonary hemosiderosis, pulmonary alveolar proteinosis, and 
lung cancer, such as adenocarcinoma, squamous cell carcinoma, small cell 
carcinoma, large cell carcinoma, and benign neoplasms *of the lung (e.g., bronchial 
adenomas and hamartomas). 

In a particular embodiment, the biological sample is a nucleic acid sample, 
30 and in a prefened embodiment the level of expression of one or more informative 
genes is detennined using oligonucleotides which hybridize to the one or more 
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informative genes. In one embodiment the oligonucleotides axe immobilized on a 
substrate. In a particular embodiment, the biological sample is RNA. Inapreferred 
embodiment, the expression is determined using a microarry having one or more 
probes (e.g., oligonucleotides) for said one or more genes immobilized thereon. 
5 In another particular embodiment, the biological sample comprises a gene 

expression product, and in a preferred embodiment the level of expression of one or 
more mformative genes is determined using polymers (e.g., polypeptides) which 
bmd to gene expression products encoded by the one or more informative genes In 
a parhcular embodiment the polymers are immobilized on a substrate 
10 ™ emventio *^rel^ 

use in expression analysis comprising obtaining an airway epithelial cell sample 
from an individual. The method may further comprise rendering nucleic acid 
molecules in said cell sample available for hybridization, and/or rendering gene 
expression products in said cell sample available for binding to polymers. 
15 The invention also relates to a method of treating a disease of the lung in 

parhcular lung cancer, comprising administering to an individual in need thereof an 
effective amount ofanagent which increases the expression of an informative gene 
whose expression is decreased in said individual as compared with a nonnal 
individual. 

The invention further relates to a method of treating a disease of the lung in 
pamcular lung cancer, comprising administering to an individual in need thereof an 
effectve amount of an agent which decreases the expression of an informative gene 
whose expression is increased in said individual as compared with a normal 
individual. 

The invention also relates to a method of treating a disease of the lung in 
parhcular lung cancer, comprising administering to an individual in need thereof an 
effective amount of an agent wmch increases the activity of an expression product of 
an informative gene whose activity is decreased in said individual as compared with 
a nonnal individual. 

The invention also relates to a method of treating a disease of the lung in 
particular lung cancer, comprising administering to an individual in need thereof an 
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effective amount of an agent which decreases the activity of an expression product 
of an informative gene whose activity is increased in said individual as compared 
with a normal individual. 

The invention also provides a substrate for use in expression analyses having 
5 immobilized thereon a plurality of polymers (e.g., oligonucleotides, polypeptides, 
etc.) which bind specifically to one or more informative genes disclosed herein or 
expression products of said genes. In a particular embodiment, the substrate is an 
array (e.g., a microarray) having immobilized thereon a plurality of oligonucleotides. 
In one embodiment, less than 1 000 oligonucleotides or polymers are immobilized on 

1 0 the array. In another embodiment, the oligonucleotides hybridize specifically to one 
allelic form of one or more informative genes disclosed herein. In a particular 
embodiment, the informative genes are selected from the group consisting 
essentially of the genes shown in FIGS. 1 A-1N, 2A-2F, 3A-3D, 4A-4P, 5A-5F, 6A- 
6R, 7A-7L and 8A-8E. In one embodiment the substrate has imm obilized thereon 

1 5 only polymers which bind specifically to one or more of the informative genes 
disclosed herein or expression products of said genes. In other embodiments the 
substrate has additional polymers immobilized thereon, such as polymers which bind 
specifically to informative genes for disorders such as types of cancer. 

BRIEF DESCRIPTION OF THE DRAWINGS 

20 The foregoing and other objects, features and advantages of the invention 

will be apparent from the following more particular description of preferred 
embodiments of the invention, as illustrated in the accompanying drawings. 

FIGS. 1A-1N is a table of genes which are differentially expressed in 
smokers and smokers with cancer. Column one is the probe set designation for the 

25 gene; Column two is the common name(s) for the gene; Column three is the 
GenBank accession number for the gene sequence; Column four is the map 
(chromosomal) position of the gene; and Column five is descriptive information for 
the gene. The genes which overlap between FIGS. 1 A-1N and FIGS. 2A-2F are 
indicated with an " A " (21 genes). The genes which overlap between FIGS. 1A-1N 

30 and FIGS. 3A-3D are indicated with an (5 genes). 
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FIGS. 2A-2F is a table of genes which are differentially expressed in 
smokers with cancer and non- smoker, Column one is the probe set designation for 
the gene; Column two is the common name(s) for the gene; Column three is the 
GenBank accession number for the gene sequence; Column four is the map 
5 (chromosomal) position of the gene; and Column five is descriptive information for 
the gene. The genes which overlap between FIGS. 1 A-1N and FIGS 2A-2F are 
indicated with an (21 genes). The genes which overlap between FIGS. 2A-2F 
and FIGS. 3A-3D are indicated with an "#" (23 genes). 

FIGS. 3A-3D is a table of genes which are differentially expressed in 
10 smokers and non- smokers. Column one is the probe set designation for the gene- 
Column two is the common name(s) for the gene; Column three is the GenBank ' 
accession number for the gene sequence; Column four is the map (chromosomal) 
position of the gene; and Column five is descriptive information for the gene The 
geneswhichoverlapbetweenFIGS.3A-3DandFIGS. 1A-1N are indicated with an ' 
15 "*"(5genes). The genes which overlap between FIGS. 3A-3D and FIGS. 2A-2F are 
indicated with an "#" (23 genes). 

FIGS. 4A-4P is a table of 131 genes which are differentially expressed and 
can be used to distinguish between non-smokers, smokers and smokers with cancer 
Column one is the fold change in expression between non-smokers and smokers- 

20 C^twoshowshowmeexpressiondiffersinsmokersascomparedtonon- ' 
smokers; Column three is the fold change in expression between non-smokers and 
smokers with cancer; Column four shows how the expression differ in smokers 
with cancer as compared to non-smokers; Column five is the fold change in 
expression between smokers and smokers with cancer; Column six shows how the 
25 expression differs in smokers with cancer as compared to smokers; Column seven is 
the common name(s) for the gene; Column eight is the GenBank accession number 
for the gene sequence; Column nine is the map position of the gene; Column ten is 
the chromosomal position of the gene; Column 1 1 is the description of the gene. 

FIGS. 5A-5F is a table of data similar to the data shown in FIGS 4A-4P for 
30 131 genes -^h are differentially expressed and can be used to di^^ 
non-smokers, smokers and smokers with cancer. Column one is the probe set 
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designation for the gene; Column two is the common name(s) for the gene; Column 
three is the GenBank accession number for the gene sequence; Column four is the 
map (chromosomal) position of the gene; and Column five is descriptive information 
for the gene. 

5 FIGS. 6A-6R is a table of genes which are differentially expressed in 

smokers and non-smokers. Column one is the probe set designation for the gene; 
Column two is the common name(s) for the gene; Column three is the GenBank 
accession number for the gene sequence; Column four is the p value for the 
smoker/non-smoker comparison; Column five shows whether the gene is up or down 
10 regulated; Column six shows the fold change in expression; Column seven is the 
GenBank description of the gene; Column eight is the functional category of the 
gene as indicated in GenBank; Column nine is the map (chromosomal) position of 
the gene; Column ten is the link locus; Column eleven is the Unigene reference for 
the gene; and Column twelve is OMM reference for the gene. On FIG. 6R, five 
1 5 probe sets are shown which resulted in redundant results. 

FIGS. 7A-7L is a table of genes which are differentially expressed in samples 
from smokers (S)> nonsmokers (NS) and smokers with cancer (C). Column one is 
the probe set designation for the gene; Column two is the gene title; Column three is 
the gene symbol; Column four is the chromosomal location of the gene; Column five 
20 is the locus link. 

FIGS. 8A-8E is a table of genes which are differentially expressed in samples 
from smokers (S) and smokers with cancer (C). Column one is the probe set 
designation for the gene; Column two is the gene title; Column three is the gene 
symbol; Column four is the chromosomal location of the gene; Column five is the 
25 locus link. 

DETAILED DESCRIPTION OF THE INVENTION 

A description of preferred embodiments of the invention follows. 
Lung cancer in smokers involves histopathological and molecular 

progression from normal to premalignant to cancer. Gene expression arrays of lung 
30 tumors have been used to characterize expression profiles of lung cancers, and to 
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show the progression of molecular changes from non-malignant lung tissue to lung 
cancer. The ability to determine which individuals, particularly among smokers, 
have molecular changes in their airway epithelial cells and how these changes relate 
to premalignant and malignant changes that have been described in the lung has 
5 great potential for determining risk and for diagnosing cancer at a stage when 
treatment can be more effective, Ihus reducing the mortality and morbidity rates of 
lung cancer. The relative ease with which airway epithelial cells can be obtained 
(from bronchoscopy and buccal mucosal scrapings) suggests that this approach will 
have wide clinical applicability and has the potential for becoming a standard 
10 clinical screening tool for the large number of subjects at risk for developing 
disorders of the lung. 

As used in this application, the singular form "a," "an," and "the" include 
plural references unless the context clearly dictates otherwise. For example, the 
term "an agent" includes a plurality of agents, including mixtures thereof. 
1 5 An individual is not limited to a human being but may also be other 

organisms including but not limited to mammals, plants, bacteria, or cells derived 
from any of the above. 

Throughout this disclosure, various aspects of this invention can be presented 
in a range format. It should be understood that the description in range format is 

20 merely for convenience and brevity and should not be construed as an inflexible 
limitation on the scope of the invention. Accordingly, the description of a range 
should be considered to have specifically disclosed all the possible subranges as well 
as individual numerical values wimin that range. For example, description of a 
range such as from 1 to 6 should be considered to have specifically disclosed 

25 subranges such as from 1 to 3, from 1 to 4, from 1 to 5, from 2 to 4, from 2 to 6, 
from 3 to 6 etc., as well as individual numbers within that range, for example, 1, 2, 
3, 4, 5, and 6. This applies regardless of the breadth of the range. 

The practice of the present invention may employ, unless otherwise 
indicated, conventional techniques and descriptions of organic chemistry, polymer 

30 technology, molecular biology (including recombinant techniques), cell biology, 
biochemistry, and immunology, which are within the skill of the art. Such 
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conventional techniques include polymer array synthesis, hybridization, ligation, and 
detection of hybridization using a label. Specific illustrations of suitable techniques 
can be had by reference to the example herein below. However, other equivalent 
conventional procedures can, of course, also be used. Such conventional techniques 
5 and descriptions can be found in standard laboratory manuals such as Genome 

Analysis: A Laboratory Manual Series (Vols, I-IV), Using Antibodies: A Laboratory 
Manual, Cells: A Laboratory Manual, PCR Primer: A Laboratory Manual, and 
Molecular Cloning: A Laboratory Manual (all from Cold Spring Harbor Laboratory 
Press), Stryer, L. (1 995) Biochemistry (4th Ed.) Freeman, New York, Gait, 
1 0 "Oligonucleotide Synthesis: A Practical Approach " 1984, IRL Press, London, 
Nelson and Cox (2000), Lehninger, Principles of Biochemistry 3 rd Ed., W.H. 
Freeman Pub., New York, NY and Berg et al (2002) Biochemistry, 5 th Ed., W.H. 
Freeman Pub., New York, NY, all of which are herein incorporated in their entirety 
by reference for all purposes. 
1 5 The present invention can employ solid substrates, including arrays in some 

preferred embodiments. Methods and techniques applicable to polymer (including 
protein) array synthesis have been described in U.S.S.N 09/536,841, WO 00/58516, 
U.S. Patents Nos. 5,143,854, 5,242,974, 5,252,743, 5,324,633, 5,384,261, 
5,405,783, 5,424,186, 5,451,683, 5,482,867, 5,491,074, 5,527,681, 5,550,215, 
20 5,571,639, 5,578,832, 5,593,839, 5,599,695, 5,624,71 1, 5,631,734, 5,795,716, 
5,831,070, 5,837,832, 5,856,101, 5,858,659, 5,936,324, 5,968,740, 5,974,164, 
5,981,185, 5,981,956, 6,025,601, 6,033,860, 6,040,193, 6,090,555, 6,136,269, 
6,269,846 and 6,428,752, in PCT Applications Nos. PCT7US99/00730 
(International Publication Number WO 99/36760) and PCT/US 01/0428 5 
25 (International Publication Number WO 01/58593), which are all incorporated herein 
by reference in their entirety for all purposes. 

The term "array" as used herein refers to an intentionally created collection of 
molecules which can be prepared either synthetically or biosynthetically. The 
molecules in the array can be identical or different from each other. The array can 
30 assume a variety of formats, for example, libraries of soluble molecules; libraries of 
compounds tethered to resin beads, silica chips, or other solid supports. 
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Additionally, the term "array" is meant to include those libraries of nucleic acids 
which can be prepared by spotting nucleic acids of essentially any length (for 
example, from 1 to about 1000 nucleotide monomers in length) onto a substrate. 

The term "solid support", "support", and "substrate" as used herein are used 
5 interchangeably and refer to a material or group of materials having a rigid or 
semi-rigid surface or surfaces. In many embodiments, at least one surface of the 
solid support will be substantially flat, although in some embodiments it may be 
desirable to physically separate synthesis regions for different compounds with, for 
example, wells, raised regions, pins, etched trenches, or the like. According to other 
1 0 embodiments, the solid supports) will take the form of beads, resins, gels, 

microspheres, or other geometric configurations. See U.S. Patent No. 5,744,305 for 
exemplary substrates. 

Patents that describe synthesis techniques in specific embodiments include 
U.S. Patents Nos. 5,412,087, 6,147,205, 6,262,216, 6,310,189, 5,889,165, and 
15 5,959,098. Nucleic acid arrays are described in many of the above patents, but the 
same techniques are applied to polypeptide arrays. 

Nucleic acid arrays that are useful in the present invention include those that 
are commercially available from Affymetrix (Santa Clara, CA) under the brand 
name GeneChip®. 

20 The present invention also contemplates many uses for polymers attached to 

solid substrates. These uses include gene expression monitoring, profiling, library 
screening, genotyping and diagnostics. Gene expression monitoring, and profiling 
methods can be shown in U.S. Patents Nos. 5,800,992, 6,013,449, 6,020,135, 
6,033,860, 6,040,138, 6,177,248 and 6,309,822. Genotyping and uses therefore are 

25 shown in U.S.S.N. 60/319,253, 10/013,598 (U.S. Patent Application Publication 
No.: 20030036069), and U.S. Patents Nos. 5,856,092, 6,300,063, 5,858,659, 
6,284,460, 6,361,947, 6,368,799 and 6,333,179. Other uses are embodied in U.S. 
Patents Nos. 5,871,928, 5,902,723, 6,045,996, 5,541,061, and 6,197,506. 

The present invention also contemplates sample preparation methods in 
v 30 certain preferred embodiments. Prior to or concurrent with genotyping, the genomic 
sample may be amplified by a variety of mechanisms, some of which may employ 
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PCR. See, e.g., PCR Technology: Principles and Applications for DNA 
Amplification (Ed. H.A. Erlich, Freeman Press, NY, NY, 1992); PCR Protocols: A 
Guide to Methods and Applications (Eds. Innis, et al 9 Academic Press, San Diego, 
CA, 1990); Mattila et al 9 Nucleic Acids Res, 19, 4967 (1991); Eckert et al 9 PCR 
5 Methods and Applications 1, 17 (1991); PCR (Eds. McPherson et al, IRL Press, 
Oxford); and U.S. Patent Nos. 4,683,202, 4,683,195, 4,800,159 4,965,188,and 
5,333,675, and each of which is incorporated herein by reference in their entireties 
for all purposes. The sample may be amplified on the array. See, for example, U.S 
Patent No. 6,300,070 and U.S. Patent Application No. 09/513,300, which are 

1 0 incorporated herein by reference. 

Other suitable amplification methods include the ligase chain reaction (LCR) 
(e.g., Wu and Wallace, Genomics 4, 560 (1989), Landegren et al 9 Science 241, 1077 
(1988) and Barringer et al Gene 89:1 17 (1990)), transcription amplification (Kwoh 
et al, Proc. Natl Acad Sci. USA 86, 1 173 (1989) and WO88/10315), self-sustained 

15 sequence replication (Guatelli etal 9 Proc. Nat. Acad. Sci USA, 87, 1874 (1990) and 
WO90/06995), selective amplification of target polynucleotide sequences (U.S. 
Patent No. 6,410,276), consensus sequence primed polymerase chain reaction (CPt 
PCR) (U.S. Patent No 4,437,975), arbitrarily primed polymerase chain reaction (AP- 
PCR) (U.S. Patent No 5,413,909, 5,861,245) and nucleic acid based sequence 

20 amplification (NABSA). (See 9 U.S. Patents Nos. 5,409,818, 5,554,517, and 

6,063,603, each of which is incorporated herein by reference). Other amplification 
methods that may be used are described in, U.S. Patent Nos. 5,242,794, 5,494,810, 
4,988,617 and in U.S.S.N. 09/854,317, each of which is incorporated herein by 
reference. 

25 Additional methods of sample preparation and techniques for reducing the 

complexity of a nucleic sample are described in Dong et al 9 Genome Research 11, 
1418 (2001), in U.S. Patent Nos. 6,361,947, 6,391,592 and U.S. Patent Application 
Nos. 09/916,135, 09/920,491 (U.S. Patent Application Publication No. 
20030096235), 09/910,292 (U.S. Patent Application Publication No. 20030082543, 

30 U.S. Patent No. 6,632,61 1), and 10/013,598 (U.S. Patent Application Publication 
No. 20030036069). 
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Methods for conducting polynucleotide hybridization assays have been well 
developed in the art. Hybridization assay procedures and conditions will vary 
depending on the application and are selected in accordance with the general binding 
methods known including those referred to in: Maniatis et al Molecular Cloning: A 
5 Laboratory Manual (2 nd Ed. Cold Spring Harbor, N. Y, 1 989); Berger and Kimmel 
Methods in Enzymology, Vol. 152, Guide to Molecular Cloning Techniques 
(Academic Press, Inc., San Diego, CA, 1987); Young and Davism, P.N.A.S, 80: 
1 194 (1983). Methods and apparatus for carrying out repeated and controlled 
hybridization reactions have been described in U.S. Patent Nos. 5,871,928, 

10 5,874,219, 6,045,996 and 6,386,749, 6,391,623, each of which are incorporated 
herein by reference. 

The present invention also contemplates signal detection of hybridization 
between ligands in certain preferred embodiments. See U.S. Patent Nos. 5,143,854, 
5,578,832; 5,631,734; 5,834,758; 5,936,324; 5,981,956; 6,025,601; 6,141,096; 

15 6,185,030; 6,201,639; 6,218,803; and 6,225,625, in U.S. Patent Application No. 
60/364,731 and in PCT Application PCT/US99/06097 (published as W099/47964), 
each of which also is hereby incorporated by reference in its entirety for all purposes. 

Methods and apparatus for signal detection and processing of intensity data 
are disclosed in, for example, U.S. Patents Nos. 5,143,854, 5,547,839, 5,578,832, 

20 5,631,734, 5,800,992, 5,834,758; 5,856,092, 5,902,723, 5,936,324, 5,981,956, 
6,025,601, 6,090,555, 6,141,096, 6,185,030, 6,201,639; 6,218,803; and 6,225,625, 
in U.S. Patent Application No. 60/364,731 and in PCT Application 
PCT/US99/06097 (published as W099/47964), each of which also is hereby 
incorporated by reference in its entirety for all purposes. 

25 The practice of the present invention may also employ conventional biology 

methods, software and systems. Computer software products of the invention 
typically include computer readable medium having computer-executable 
instructions for performing the logic steps of the method of the invention. Suitable 
computer readable medium include floppy disk, CD-ROM/DVD/DVD-ROM, hard- 

30 disk drive, flash memory, ROM/RAM, magnetic tapes and etc. The computer 
executable instructions may be written in a suitable computer language or 
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combination of several languages. Basic computational biology methods are 
described in, e.g. Setubal and Meidanis et al, Introduction to Computational 
Biology Methods (PWS Publishing Company, Boston, 1997); Salzberg, Searles, 
Kasif, (Ed.), Computational Methods in Molecular Biology, (Elsevier, Amsterdam, 
5 1 998); Rashidi and Buehler, Bioinformatics Basics: Application in Biological 
Science and Medicine (CRC Press, London, 2000) and Ouelette and Bzevanis 
Bioinformatics: A Practical Guide for Analysis of Gene and Proteins (Wiley & 
Sons, Inc., 2 nd ed., 2001). 

The present invention may also make use of various computer program 
1 0 products and software for a variety of purposes, such as probe design, management 
of data, analysis, and instrument operation. See, U.S. Patent Nos. 5,593,839, 
5,795,716, 5,733,729, 5,974,164, 6,066,454, 6,090,555, 6,185,561, 6,188,783, 
6,223,127, 6,229,91 1 and 6,308,170. 

Additionally, the present invention may have preferred embodiments that 
1 5 include methods for providing genetic information over networks such as the 
Internet as shown in U.S. Patent Application Nos. 10/063,559 (U.S. Patent 
. Application Publication No. 20020 1 83936), 60/349,546, 60/376,003, 60/394,574, 
60/403,381. 

Further methods for practicing the present invention are also exemplified in 
20 Spiraera/., Chest 125(5 Suppl):115S (2004) ; Spira etal, Biotechniques 

5d(3):484-7 (2004) ; and Powell etal, Am. J. Respir. Cell. Mol. Biol. 2P(2):157-62 
(2003), the teachings of all of which are incorporated herein in their entirety. 

As used herein, an informative gene is a gene which is differentially 
expressed in sample populations with different phenotypes (e.g., presence of or 
25 susceptibility to disorders of the lung, particularly lung cancer) or behavioral traits 
(e.g., smoking), e.g., differentially expressed in smokers as compared with non- 
smokers, in smokers with cancer as compared to smokers without cancer, in smokers 
likely to develop cancer as compared with smokers unlikely to develop cancer, etc. 
In one embodiment, an informative gene displays at least a fold change shown in one 
30 of the rows of Column six of FIGS. 6A-6R. In a preferred embodiment, an 

informative gene displays at least a two-fold, more preferably at least a five-fold, and 
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even more preferably at least a ten-fold difference in expression between two sample 
populations. Examples of informative genes are shown in FIGS. 1A-1N, 2A-2F, 
3A-3D, 4A-4P, 5A-5F, 6A-6R, 7A-7L and 8A-8E; other informative genes can be 
identified using methods described herein. 
5 As used herein, gene expression products are proteins, peptides, or nucleic 

acid molecules (e.g., mRNA, tRNA, rRNA, or cRNA) that are involved in 
transcription or translation. The nucleic acid molecule levels measured can be 
derived directly from the gene or, alternatively, from a corresponding regulatory 
gene. All forms of gene expression products can be measured, including, for 
10 example, spliced variants. Similarly, gene expression can be measured by assessing 
the level of protein or derivative thereof translated from mRNA. In this embodiment 
the determination of the gene expression profile can be made using techniques for 
protein detection and quantitation known in the art. For example, antibodies specific 
for the protein or polypeptide can be obtained using methods which are routine in 
15 the art, and the specific binding of such antibodies to protein or polypeptide gene 
expression products can be detected and measured. Alternatively, polymers 
immobilized on a substrate (e.g., 1 protein arrays) can be used. 

In the embodiment wherein arrays of nucleic acids are immobilized on a 
surface, the number of nucleic acid sequences may be selected for different 
20 applications, and may be, for example, about 100 or more, or, e.g., in some 

embodiments, more than 10 s or 10 8 . In one embodiment, the surface comprises at 
least 1 00 probe nucleic acids each preferably having a different sequence, each probe 
contained in an area ofless than about 0.1 cm 2 , or, for example, between about 1 
urn 2 and 10,000 urn 2 , and each probe nucleic acid having a defined sequence and 
25 location on the surface. In one embodiment, at least 1,000 different nucleic acids are 
provided on the surface, wherein each nucleic acid is contained within an area less 
than about 10 3 cm 2 , as described, for example, in U.S. Patent No. 5,510,270, the 
disclosure of which is incorporated herein. 

Arrays of nucleic acids for use in gene expression monitoring are described 
30 in PCT WO 97/10365, the disclosure of which is incorporated herein. In one 

embodiment, arrays of nucleic acid probes are immobilized on a surface, wherein the 
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array comprises more than 100 different nucleic acids and wherein each different 
nucleic acid is localized in a predetermined area of the surface, and the density of the 
different nucleic acids is greater than about 60 different nucleic acids per 1 cm 2 . 

Arrays of nucleic acids immobilized on a surface which may be used also are 
5 described in detail in U.S. Patent No. 5,744,305, the disclosure of which is 
incorporated herein. As disclosed therein, on a substrate, nucleic acids with 
different sequences are immobilized each in a predefined area on a surface. For 
example, 10, 50, 60, 100, 10 3 , 10 4 , 10 5 , 10 6 , 10 7 , or 10 8 different monomer sequences 
may be provided on the substrate. The nucleic acids of a particular sequence are 
10 provided within a predefined region of a substrate, having a surface area, for 

example, of about 1 cm 2 to 10" 10 cm 2 . In some embodiments, the regions have areas 
of less than about 10 1 , 10' 2 , 10" 3 , 10" 4 , 10 5, 10*, 10" 7 , 10" 8 , 10' 9 , or 10' 10 cm 2 . For 
example, in one embodiment, there is provided a planar, non-porous support having 
at least a first surface, and a plurality of different nucleic acids attached to the first 
15 surface at a density exceeding about 400 different nucleic acids/cm 2 , wherein each of 
the different nucleic acids is attached to the surface of the solid support in a different 
predefined region, has a different determinable sequence, and is, for example, at 
least 4 nucleotides in length. The nucleic acids may be, for example, about 4 to 20 
nucleotides in length. The number of different nucleic acids may be, for example, 
20 1 000 or more. In the embodiment where polynucleotides of a known chemical 
sequence are synthesized at known locations on a substrate, and binding of a 
complementary nucleotide is detected, and wherein a fluorescent label is detected, 
detection may be implemented by directing light to relatively small and precisely 
known locations on the substrate. For example, the substrate is placed in a 
25 microscope detection apparatus for identification of locations where binding takes 
place. The microscope detection apparatus includes a monochromatic or 
polychromatic light source for directing light at the substrate, means for detecting 
fluoresced light from the substrate, and'means for determining a location of the 
fluoresced light. The means for detecting light fluoresced on the substrate may in 
30 some embodiments include a photon counter. The means for determining a location 
of the fluoresced light may include an x/y translation table for the substrate. 
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Translation of the substrate and data collection are recorded and managed by an 
appropriately programmed digital computer, as described in U.S. Patent No. 
5,510,270, the disclosure of which is incorporated herein. 

The sample to be assessed can be any sample that contains a gene expression 
5 product. Suitable sources of gene expression products, i.e., samples, can include 
cells, lysed cells, cellular material for determining gene expression, or material 
containing gene expression products. Examples of such samples are blood, plasma, 
lymph, urine, tissue, mucus, sputum, saliva or other cell samples. Methods of 
obtaining such samples are known in the art. In a preferred embodiment, the sample 
10 is an airway epithelial cell. 

Disorders which may be diagnosed or treated by methods described herein 
include, but are not limited to, asthma, chronic bronchitis, emphysema, 
bronchietasis, primary pulmonary hypertension and acute respiratory distress 
syndrome. The methods described herein may also be used to diagnose or treat lung 
15* disorders that involve the immune system including, hypersensitivity pneumonitis, 
eosinophilic pneumonias and persistent fungal infections. Other disorders include 
pulmonary fibrosis, systemic sclerosis, ideopathic pulmonary hemosiderosis, 
pulmonary alveolar proteinosis, cancers of the lung such as adenocarcinoma, 
squamous cell carcinoma, small cell and large cell carcinomas, and benign 
20 neoplasms of the lung including bronchial adenomas and hamartomas. 

The present invention also provides methods for monitoring the effect of a 
treatment regimen in an individual by monitoring the expression for one or more 
informative genes. For example, a baseline expression level for one or more 
informative genes from the individual can be determined, and repeated gene 
25 expression levels can be determined at time points during treatment. A shift in gene 
expression from a profile correlated with the disorder or with poor treatment 
outcome to a profile correlated with lack of the disorder or improved treatment 
outcome is evidence of an effective therapeutic regimen, while a repeated profile 
correlated with the disorder or with poor treatment outcome is evidence of an 
30 ineffective therapeutic regimen. 
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It is also clear that the present invention can be used to generate databases 
comprising informative genes which will have many applications in medicine, 
research and industry and which can be used as a point of comparison in many of the 
analyses described herein. 
5 The present invention has many preferred embodiments and relies on many 

patents, applications and other references for details known to those of the art. 
Therefore, when a patent, application, or other reference is cited herein, it should be 
understood that it is incorporated by reference in its entirety for all purposes as well 
as for the proposition that is recited. 

10 EXAMPLES 
Example 1 
Methods 

Samples of epithelial cells, obtained by brushing airway surfaces, were 
obtained from intra- and extra-pulmonary airways in 1 1 normal non-smokers (NS), 

15 15 smokers without lung cancer (S), and 9 smokers with lung cancer (SC). 5-10 fig 
ofRNA was extracted using standard trizol-based methods, quality of RNA was 
assayed in gels, and the RNA was processed using standard protocols developed by 
Afifymetrix for the U133 human array. Expression profiles, predictive algorithms, 
and identification of critical genes are made using bioinformatic methods. In 

20 particular, "Significance analysis of microarrays" (SAM) software (Tusher et al 9 
PNAS, 98:51 16-21, 2001) and "Prediction analysis for microarrays" (PAM) software 
(Tibshirani et al 9 PNAS, PP:6567-72, 2002) (both Stanford University Proteomics 
Center) were used. 

For the analysis shown in FIGS. 4A-4P, samples were obtained as described 
25 above from 6 smokers with lung cancer (SC), 9 smokers without lung cancer (S) and 
5 normal non-smokers (NS). 
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Results 

FIGS. 1A-1N show genes which are mfferentially expressed in smokers and 
smokers with cancer; FIGS. 2A-2F show genes which are differentially expressed in 
smokers with cancer and non- smokers; and FIGS. 3A-3D show genes which are 
5 differentially expressed in smokers and non- smokers. 

FIGS. 4A-4P and 5A-5F show 131 genes which are differentially expressed 
and can be used to distinguish between non-smokers, smokers and smokers with 
cancer. 

FIGS. 6A-6R show genes which are differentially expressed in smokers and 
10 non-smokers. 

FIGS. 7A-7L is a table of genes which are differentially expressed in samples 
from smokers (S), nonsmokers (NS) and smokers with cancer (C). 

FIGS. 8A-8E is a table of genes which are differentially expressed in samples 
from smokers (S) and smokers with cancer (C). 

15 

The contents of preferences, patents and published patent applications cited 
throughout this application are hereby incorporated by reference. 

While this invention has been particularly shown and described with 
references to preferred embodiments thereof, it will be understood by those skilled in 
20 the art that various changes in form and details may be made therein without 
departing from the scope of the invention encompassed by the appended claims. 
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CLAIMS 

What is claimed is: 

1 . A method of diagnosing a lung cancer comprising: 

a) obtaining a nucleic acid sample from an individual to be diagnosed; 
5 and 

b) determining the expression of one or more informative genes in said 
sample, wherein the informative genes are selected from the group 
consisting of the genes shown in Figures 1 A-1N, 2A-2F, 3A-3D, 4A- 
4P, 5A-5F, 6A-6R, 7A-7L and 8A-8E, 

0 wherein increased expression of an informative gene whose expression is 

increased in individuals having a lung cancer, or decreased expression of an 
informative gene whose expression is decreased in individuals having a lung 
cancer, is indicative of a lung cancer in the individual. 

2. A method according to Claim 1, wherein the lung cancer selected from the 
5 group consisting of adenocarcinoma, squamous cell carcinoma, small cell 

carcinoma, large cell carcinoma, and benign neoplasms of the lung. 

3. A method according to Claim 2, wherein the lung cancer is a benign 
neoplasm selected from the group consisting of bronchial adenomas and 
hamartomas. 

) 4. A method according to Claim 1 , wherein the nucleic acid sample is RNA. 

5. A method according to Claim 1 , wherein the individual to be diagnosed is an 
individual who has been exposed to tobacco smoke. 

6. A method according to Claim 1 , wherein the individual to be diagnosed is an 
individual who has smoked. 
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7. A method according to Claim 1 , wherein the individual to be diagnosed is an 
individual who smokes. 

8. A method according to Claim 1 , wherein the expression of two or more 
informative genes is determined. 

5 9. A method according to Claim 1 , wherein the expression of five or more 
informative genes is determined. 

10. A method according to Claim 1 , wherein the expression of ten or more 
informative genes is determined. 

11. A method according to Claim 1, wherein the expression of fifteen or more 
10 informative genes is determined. 

12. A method according to Claim 1 , wherein the expression of twenty or more 
informative genes is determined. 

13. A method according to Claim 1 , wherein the expression of fifty or more 
informative genes is determined. 

15 14. A method according to Claim 1 , wherein the expression of one hundred or 
more informative genes is determined. 

15. A method according to Claim 1, wherein the nucleic acid sample is obtained 
from an airway epithelial cell. 



16. 

20 



A method according to Claim 15, wherein the airway epithelial cell is 
obtained from a bronchoscopy or buccal mucosal scraping. 
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17. A method according to Claim 1, wherein the expression is determined using 
a microarry having one or more probes for said one or more genes 
immobilized thereon. 

18. A method of treating a lung cancer comprising administering to an individual 
5 ' in need thereof an effective amount of an agent which increases the 

expression of an informative gene whose expression is decreased in said 
individual as compared with a normal individual, wherein the informative 
gene is selected from the group consisting of the genes shown in Figures IA- 
IN, 2A-2F, 3 A-3D, 4A-4P, 5A-5F, 6A-6R, 7A-7L and 8A-8E. 

10 19. A method of treating a lung cancer comprising administering to an individual 
in need thereof an effective amount of an agent which decreases the 
expression of an informative gene whose expression is increased in said 
individual as compared with a normal individual, wherein the informative 
gene is selected from the group consisting of the genes shown in Figures 1 A- 

1 5 IN, 2A-2F, 3 A-3D, 4A-4P, 5 A-5F, 6A-6R, 7A-7L and 8A-8E. 



20. A method of treating a lung cancer comprising administering to an individual 
in need thereof an effective amount of an agent which increases the activity 
of an expression product of an informative gene whose activity is decreased 
in said individual as compared with a normal individual, wherein the 

20 informative gene is selected from the group consisting of the genes shown in 

Figures 1 A-1N, 2A-2F, 3A-3D, 4A-4P, 5A-5F, 6A-6R, 7A-7L and 8A-8E. 

21 . A method of treating a lung cancer comprising administering to an individual 
in need thereof an effective amount of an agent which decreases the activity 
of an expression product of an informative gene whose activity is increased 

25 in said individual as compared with a normal individual, wherein the 

informative gene is selected from the group consisting of the genes shown in 
Figures 1 A-1N, 2A-2F, 3A-3D, 4A-4P, 5A-5F, 6A-6R, 7A-7L and 8A-8E. 
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22. A substrate having immobilized thereon a plurality of less than one thousand 
polymers which bind specifically to an expression product of one or more 
genes selected from the group consisting essentially of the genes shown in 
Figures 1A-1N, 2A-2F, 3A-3D, 4A-4P, 5A-5F, 6A-6R, 7A-7L and 8A-8E. 



WO 2004/111197 



PCT/US2004/018492 



1/81 





Description 


tyrosine aminotransferase (TAT), nuclear gene 
encoding mitochondrial protein 


putative L-type neutral amino acid transporter 


hypothetical protein KIAA0978 protein I 


polyglutamine repeat; calcium (ca2+) homeostasis 
endoplasmic reticulum protein (ERPROT213-21) 


serine (or cysteine) proteinase inhibitor, clade A 
(alpha-1 antiproteinase, antitrypsin), member 1 
(SERPINA1) 


RNA binding protein 


deubiquitihating enzyme UnpES (UNP) ubiquitin 
specific protease 4 (proto-oncoqene) 


ESTs 


hemoglobin, gamma A 


ESTs, Weakly similar to S65173 probable membrane 
protein YPL162c - yeast 


tubulin, alpha, ubiquitous 


KIAA0326 protein ~l 


gamma tubulin ring complex protein (76d aene) I 


complement component 1, q subcomponent binding 
protein 


potassium voltage-gated channel, Isk-related family 


nuclear RNA export factor 1 


claudin 3 




Map 


csi 

CN 

cr 

CD 


2p22.1 


20q11.1 


19p13.1 


14q32.1 


8p11.23 


3p21.3 


o 


LO 

in 

x — 
Q. 

Y— 


CO 


12q12- 
12q14.3 


16p11.1 


15q15 


17p13.3 


CN 

CN 
CN 

cr 

CN 


CO 
O" 

i 

CNJ 
CT 
x— 


7q11.23 




Genbank 


NM 000353 


AB007896 


AL1 17518 


NM 006387 


NM 000295 


AF251062 


AF017306 


AW025579 


AM 33353 


BG031677 


AL581768 


AI924293 


BF592982 


AU151801 


NM 000219 


BC004904 


BE791251 




Common 


TAT 


KIAA0436 


KIAA0978 


CHERP 


SERPINA1 


LOC84549 


USP4 




HBG1 




K-ALPHA-1 


KIAA0326 


76P 


C1QBP 


KCNE1 


NXF1 


CLDN3 

1 


53003_analysi 
s 


Systematic 


206916 x at 


212217_at 


212238 at 


202230 s at 


202833 s at 


211686 s at 


211800 s at 


213429 at 


213515 x_at 


to 

LO 
CO 
CO 

? 


212639 x at 


213196_at 


213266_at 


214214 s at 


5o 

oo 


I208922 s at 


203953 s at 



WO 2004/111197 



PCT/US2004/018492 



2/81 



CL 
CD 

CD 
CD 
O 



I 

Q_ 

E 

c 
5 

"co 

CD 



J0 
QJ 



o 

CD 
O 



05 



CM 



CM 
CO 



CO 
CD 
CD 
CO 
O 
O 



CO 

CD 
O 
O 



CD 
00 
CD 
O 
CM 



CM 
CO 



LL 

cr 



c 

2 
a 

E 
x 
2 

CL 

1^ 



LL 



CM 

Cl 
il 

0 

>+— ' 

Qj 
CD 



LO 
CM 
O 

o 



0 
o 

CO 

co 
> 



CM 



CD 



CM 
CM 

Q 

CD 
00 
ID 



CD 
CD 
CD 
O 
CM 



CD 

E 
E 

CO 



0) 

to 
c 



CD 

8 

CO 
CO 

a: 



E 
o 

Cl 
CO 



oi 

CD 



S 

o 

CD 

c 

CD 
CD 

c 

CO 

3 

CO 

c 
» 

2 

Cl 
O 

a> 
o 

3 
C 
O 



O ^ 

CO to 



CD 



CO 

o 
c 

E 

CO 



2 xi 
c 

•2 o 

CO '"' 

o, 
^ E 

CO 



CO 



CM 



00 



o 

ID 

CM 



E 

CO 

+— 

i— 

CD 

La 

CO 

o 

<D 



CO 
+ 



O 
Cl 
CO 
c 

CO 



CM 

CN 
CN 

CD 



CO 
00 
CO 
CO 

o 
o 



o 


F7 


< 


CO 


CQ 










H 








CO 


CO 




X 


CD 


CO 


CO 


r— 


CD 


CO 


CM 


o 


CD 


5", 


O 


CM 



CM 
< 
CO 
CO 



CO 



00 
CO 

o 

CM 



ID 
Cl 

3 t- 
O 



CO 



2 

Cl 
0 

3 

O 

o 

I 

c 

0 



CD 
O 

I— 

CD 

E 

CD 

CD E 



lo 

CM 

en 



CD 
CO 

o 

CM 
CM 
O 



CD 


o 


CM 


LO 


CO 


o 


T— 
—3 
-J 


PR 


LL 


CD 


» 


•*— » 


CO 


CO 


CO 


CO 


CD 






CM 


to 


CO 


CO 


CO 


O 


x— 


CM 


CM 



CO 

E 
o 

CO 
X 

o 

L> 

CD 
CL 

c\T 

0 
CO 
CO 

t> 

=3 

o 
>. 

go, 

a) q: 
5 o 



CM 
CO 



CD 



CD 
CD 

o 

CM 
O 



CD 

Q 
O 
t£ 
=) 
LLi 

CD 

c 
o 

■4— » 

5 
c 

2 



CO 

c 
2 

Cl 
0 



0 
CO 

s 

CO 

sz 

CL 
£0 
O 

CL. 

0 
CO 
CO 

E 



E 



CO 
CM 

QJ 



CD 
00 

o 

CD 



CM 

V£L 
O 
LU 
Q 



CO 

co 

CD 
CD 
x— 
CM 



"a 



CD 

T~ 

LO 
O 

O 



CM 
CO 
00 

CM 
# 



& CO 

co *cr 
•c -a 

"D C 
C O 
O SZ 

sz o 
o o 
o .t- 

•1 E 

^ CD 



0 o 
CO o 

is 

>, 0 

CO O) 
< CO 

a: •§ 

*- 3 

>. C 
G - 

co jq- 

^ QJ 



CO 

co 

CO 



LO 
CO 
CM 



CO 

a 



CO c 



0 
2 



CO 

o 
0 

-C 
■»-» 

o 

Cl 



0 

2 

CL 

CD 
C 



2 
J5 2 

15 



E 

CD 
O 

O 
a. 



o 

CO 

g 



CO 



CO 
CO 

i 



CM , , 
CO ^ 

T — I— 

LO 0 



CO 
CO 
-Q 

o 
E 

CO 

_co 
Cl 

f 

or 

£Z 

o 

CD 
0 



CD 
CM 



CD 
CO 
CO 
LO 



CD 
CM 



CO 

o 



CM 
CO 

cr 



CO 

CD 1 
CD 

00 



CM 
Z 

CO 

o. 

N 
LL 

Q co 



o 

CM 
LU 

o 



co 
CO 
CD 
O 

CO 



<r _co £5 

^ (D (D 

E ^ 



QQ 



CM 



00 



00 
LO 

o 

CD 



00 

o 

CM 
CD 

2 



0- 
OQ 
LU 

LL 
LU 





















CO 




a 


1^ 


CO 




CO 


CO 




o 


LO 


LO 




T— 


o 


CM 


CM 


CO 


CM 


CM 





WO 2004/111197 



PO7US2004/018492 



3/81 



X 
CL 



a] 

CM 



© 

o 

Cl 

>> 

-C 



c 
c 



0) 
Q. 

CO 

3 



CO 

E 

0) 
© 



= 1 

CO (- 

sz 

' c 

t- '55 

X O 

co 2 

I— QJ 







CO 




CM 




CNJ 


CM 


CO 


cr 


CL 


cr 


CO 




CM 



CD 

CO 
CO 

© 
c 

CD 

CO 
>> 

o 
o 
c 

© 

CO 

i 

3 



c 
0 

CX) 
CO 

o 



CO 



CO 

CD 

&g 

a) .£ 
> co 

c O 

0) O 



O m 



CD 
C 
0 
O) 



a? 

in 

< jA 
Q 

CD 

•*—» 

5 c ro 



CO 

c 

CO 
*—» 

c 

o 

o 

CM 
CM 

CD 

E 
o 

CO 

o 

E 

2 
sz 
o 



c 
CO 



CD 
c 

CD -p 

8*5 

CD ^ 
CO 
CL 



(D 
CO 

2 
o 

-C 
CL 

CO of 
CO X> 



(D 

4— » 

i 

CO 

Crj 
CM 
CM 



C 
0 

2 

CL 

"co 

o 

© 
o 

Q. 

-c 



jO 

I 

CO 

co" 
c 

CD 

CO oo 
O CM 

P 



CM 
CM, 

cr 



© 

E 
o 

CO 

o 
E 
2 

o 

c 
o 

CO , 

* = 

CO CJ 
CD .Q 

£is 

a> 
c 

O .!= 

E 3 
o - 

*= O 

O <o 
D" CM 

© cr 

CO CO 



CO 
CM 
i 

in 

CM 

cr 



O 
li- 
ra 
*c 

"D 
C 

o 
sz 
o 
o 



c 
€ 
o 

CL 
CO 

c 

CO 



+ 

X 

©" 

CO 
CO 



CO 0) 
CL 



CO 

cr 

CM 



o 
oa 

g 

^< 
CD -J 

=3 O 

£C 

Q 

> 

io cr 

© UJ 



co 
dJ 



CM 
>< 

"5 

E 

© 
> 



© 
a 

2 

© 
c 

c 

E 3 

CO 

© 
c 

I— 

© 

CO 



• CO 
CM 
CM 
O 
O 
O 

o 

00 

_Q 

CD 



I s - 
CM 
I s - 
O 
CM 



C 
'© 

o 

I— 
Q. 

15 

o 
o 



CO 

o 

I s - 

o> 
o 
o 

X 
© 



o 
co" 

in 

CM 
-5 



Q 

o 

CO 

c 
© 
'a. 
co 

co 

o 

E 
o 
X 



IL 

CM 
O 
CM 



© 

CL 
© 

CL 
.>» 

o 

CL 

I 

CL 
O 
JD 

o 
o 
c 
o 



CO 

© 
o 

p 

CO 



CO 

CO 
t— 
CL 
CO 



CM 

in 

CL 



CM 

CM 
v— 

a 

CO 



CO 

CD 
CM 

cr 
in 



CD 

m 
oo 

CM 
O 
O 



CM 

o 
co 
o 
o 



CD 



CM 
CO 
CD 

N 



CO 

CO 
CD 
O 



co 

CM 

co 
co 



CD 

oo 

CO 

o 
o 



co 
oo 

CM 
CM 
O 

< 



05 
CD 

in 
in 
co 



co 

CM 
CM 

o 
o 
o 
o 

m 



CO 
CM 

T™ 

CO 
CO 

LU 

CD 



CO 
CO 



o 

CD 
O 
CO 
O 
O 



X 
CL 



Q 
D 



co 

00 

o 
o 

CM 



CO 

CO 

CO 
r- 
o> 
o 
o 

CM 



CO 
CO 

I s - 
in 
co 
m 

CM 



I s - 
co 

CM 



LL. 



2 

CO 
CO 

"3 
T3 



CO 


■4— • 

CO 


CO 


CM 


in 


CO 


o 


I s - 


o 


m 




in 


CD 


CD 


N- 


r— 




o 


CM 


CM 


CM 



3 

Q 
> 



co 

CO 

co 
oo 

CM 



CM 

a: 
co 



CM 

o 

CM 
"O 



CM 

o 

CM 



< 
CL 

a: 

CO 



CO 
CO 

O 

v- 

co 

00 

o 

CM 



CO 
CO 
00 

in 

CM 
CM 



CO 

I s - 

CM 

o 

CM 

T- 

CM 



CO 
CO 

in 
in 
m 
co 
o 

CM 



CO 

in 

in 
in 
o 

CD 

o 

CNJ 



WO 2004/111197 



PCT/US2004/018492 



4/81 



CO 

c 
o 



o 
CO 
c 

CO 



c 

05 

E 
X 

LO 
h- 

T— 

05 
CM 

CO 
X 

o P 

CD £ 
r= CO 

E 

» o 

•4— i 

co 

.c .> 
.5> o 



CM 
CO 



JO 

E 
CO 2 



0 
C 
0 
CD 

C 



CD 



CM 



C 

0 

2 

I 

-4— ' 

CD 
JC 

o 

CL 
>> 



CM 

a. 

CM 



CO 

10 

CO 
CM 
O 

< 



i 

CM 
Li_ 

CO 



O CD CO ±f 
O — I -J CM CM 

COOQGQCOCOirHS 



CO 
CM 

crl 



CM 

CM, 
Dl 



CO 


^0 


"S 


X 


X 


co 


00 




CT) 


CM 




O 




CD 


N- 


^ 








CM 




CM 


CM 


CM 



O CM 
CD N 



CO 
N- 
CO 
CO 



a. 
m 

CM 

>- 

CO 



CO 
CD 

uo 

r~ 
CM 

.< 



CO 



CM 








CM 


co 


CT 




lO 




V 


m 



c 
o 

CD 

2 
© 

CO 
CO 

> 

CD. 



CO 



oo co 

LO 

D) 
C C 
© C 

.2>'co 

CO o 

■o ° 
$ £ 

CO CD 

a "5 
o £ 
CO cl 

CO 
CO 
I 

CO 

E 
o 
c 

b 



CO 
I 

CM 





CD 




o 


o 


CM 


o 


CM 


CO 


x— 


CO 


T— 


00 


Li. 




< 



CO CO 

° =;= 
I « § 

CD TJ "O 

o c , 

■2 05 sL 

CO _ O) 

Q-oo .g 



Q. 
CD 
O 
CD 



C 

E 

CO 



CO 



c 

0) 

E 

D) 



'CM 



LO 
CD 
CD 



CO 


CO 


C0 | 


CO 


CO 


o 


o 


CO 


LO 












CM 


CN 



to, 

CM. 
CM 

o 

CM 



CO 
CM 

c*1 



c 
-a 
c 

4 

CO 

c 
QJ 
co 
o 

CO 



0 

UP ■ 



CM 



LO 
O 

o 

a> 

LO 
CD 



CM 

0 
o 

s> 

"a 

JD 

Of 
o 

? 
c 
0 

p 



i 

Q-J 

S 
CO 

i 

0 

c 

0 

DD 
w 

S 
8 

CD 



CO 

CO 
CM 



CD 
CM 

a: 
o 
o 

UL 



CM 


*t 




Ql 


CM 




0- 


x— 


o 


LU 


"5 
-J 


Q 


CL 


LL 


or 



to 


■4-» 

CO 


CO 


X 


CO 


a> 


CO 


00 


a> 


CD 


00 


xj" 


o 


X— 


CM 


CM 



00 



CM 



CO 



CO 



co 



1 




CO 




CN 


CO 


LO 


LO 


c\i 


CM 


CN 


T— 


cr 


CT 








o 









CO 

CM 
00 
CD 
CD 
O 
CM 



CO 

o 
o 

CO 

o 
o 



o 

LU 
CO 



CO 

CO 

Cvl 
CO 

o 

CNI 
CD 
CSI 



LO 

M 

CO 

LO 

i 



H 



C0 
CO 

o 

CM 

o 

CM 



WO 2004/111197 



PCT/US2004/0 18492 



5/81 



retinoic acid- and interferon-inducible protein (58kD) 


translation initiation factor 1A (EIF1A) ! 


RAD23 homolog B (S. cerevisiae) I 


histone deacetylase 2 (HDAC2) I 


GAP-associated tyrosine phosphoprotein p62 
(Sam68) . 


pleckstrin homology, Sec7 and coiledcoil I 


integrin, beta 7 (ITGB7) 


YY1 transcription factor 


kinesin family member 5B j 


beta protein 1 BP1 


PC3-96 protein (PC3-96) I 


similar to rat myomegalin I 


putative receptor protein (PMl) 


clone:RES4-24A I 


hypothetical protein MGC5242 (MGC5242) ! 


novel gene from PAC 117P20. chromosome 1 ! 


transcription factor 12 (HTF4, helix-loop-helix 
transcription factors 4) 


hypothetical protein PRO1097 ! 


surfactant, pulmonary-associated protein C 
(SFTPC) 


MUC18 glycoprotein melanoma adhesion molecule 
/FL=gb:M29277.1 


solute carrier family 25 (mitochondrial carrier; 
dicarboxylate transporter) 


KIAA0582 protein KIAA0582 


POU domain, class 3, transcription factor 1 
(POU3F1) 


10q23.33 


X 


9q31.2 


6q21 


1p32 


17q25 


12q13.13 


CT 


10pter-q22.1 


17q21.33 


3q13.31 


CM 
CT 


17p13.1 


4p16.3 


7q31.32 


1q22 


15q21 




CM 

a 

CO 


CO 

CO 
CM 

cr 


17q25.3 


2p14 


1p34.1 


N47725 


NM_001412 


AL527365 


NM 001527 


BC000717 


NM_004762 


NM_000889 


NM_003403 


BF223224 


AF254115 


NM 022488 


AB042555 


NM 003876 


AB000464 


NM__024033 


AL035369 


AL559478 


NM 018499 


NM 003018 


M29277 


NM_012140 


NM 015147 


NM 002699 


RI58 


EIF1A 


RAD23B 


HDAC2 


KHDRBS1 


PSCD1 


ITGB7 


YY1 


KIF5B 


DLX4 


APG3 


PDE4DIP 


• IlAId 




FLJ11000 


MGC9084 


TCF12 




SFTPC 


MCAM 


SLC25A10 


RAB1A 


POU3F1 


CO 
CO 


< 

CO 
CO 


"S 

CO, 


■*-» 

C3 


CO 
X 


■4— * 

CO 
CO 


-♦— ' 

CD 


CO 

co 


"S 

CO 


CO 

to 


•♦-» 

CO 


to 

X 




CO 

in 

CO 
CO 

x— 
CM 
< 


CO 

co 


-*— » 

CO 


«*-» 

CO 


CO 


+-» 

CO 
X 


CO 
CO 


«♦-» 

CO 


CO 
CO 


CO 
CO 


203595 


201019 


201222 


*201833 


201488 


202880 


205718 


201901 


201991_ 


210259. 


220237 


209700 


203437 


220949 


213528 


208986 


221145. 


205982 


210869 


218275 


207971 


208345 



WO 2004/111197 



6/81 



PCT/US2004/018492 



00 

o 

CD 
CD 

to 

CL 

ISl 
LL. 

a 



c 

.2 

V- > 
CD 
O 

-2 

CO 

c 

"5 



c 

CO 
"i— 
CO 

> 

CO 

E 
o 

E 



jo 

O 



o 

E 
o 
X 
ll 

u. 

LU 
Q 



LO 
CD 
CM 
ID 
O 
CL 
LU 
X 

0 
c 
o 
o 

C0~ 



CO 

o 

CD 

in 

CM 

o 
-2, °> 

,fs ™ 

co Q 
3 o 

CO 

c to 

0 c 

CO CD 

O co 



s I 



D) 



UJ co 

a: ~ 

5 

Q 



< 

2: 
a: 

E 
ii 

i±j CD 



or 
CM 



CO 

CN 
O" 
CO 



LO 
CD 
O 

o 

LO 



LO 

CO 

o 

CM 
O 

o 



00 

o 

CD 
LO 

CNJ 

o 

< 



< 

O 
E 

s 

o 

00 



o 






CO 

s 




<!)" 




E 












c 




0 




CD 




.E 












0 




< 




2 








E 


CM 


CO 






M 


1 


e-l 




."2 


ipop 


pept 


o 


>* 


Q_ 


o 


CO 


CL 



CO 




CO 


CM 






a" 


CM 


CM 


D" 


CM 





o 

CO 

CO 
LO 
CD 



CO 



CM 
CL 
CQ 
GO 

£ 

CM 

0 

■4— ' 

o 

Q- 

OJ 
C 

"D 
C 



I s - 
CD 

oo 
LU 
00 



o4 

CM 



C 

0 

2 

Q. 

CD 

m- 
o 



CO 

3L 

Dl 
CD 



O 
O 
CO 
CO 
i LO 
CO 



o 
E 
0 

CO 

Ql 



1 

»— 
Q- 
O 

o 
o 

c 
o 



CO 

0 

o 

Z3 
C 

CO 

O 

0 ^ 

2 £ 



CO 



CO 

c 
0 

4— » 

o 

i_ 
Q. 

OJ 
C 

c 
1q 
c 

I 

o 



c 

a 
s 

Q- 

75 
o 

» 

0 
.1= 

o 

Q_ 

>* 



JO 

E 

to 

>» 
0 

2 

0 



CM 

CjTjcO 



CO 



CO 
CD 

LO 

o 
o 



to 
LU 



CM 



o 

CD 



I 



CM 

CO 

■4— » 

0 

N 

c 

*0 

2 
CM 
c 
o 

U— » 
CO 

c 
o 

0 

"D 

c 

CO 

c 
o 

U- » 
CO 

o 
o 

CO 
CO 



+ 

CL 
Q 

< 
0 

CO 
CO 

c 

0 

CD 

2 
0 

I 



CM 
CM 



•o 

>^ 
H 

g 

I s 

_0 -n 
>* C 
-C 0 

E "D 



CM 

cr 



CD 
LO 
CO 
LO 

o 
o 



CM 

CO 
CM 

CO 



0 

CO 

sz 

CO 

o 

oj 

*g 
'o 

CO 

o 



CO 
CM 
QJ 



CM 
CO 



c 

0 

B 

CD 



CM 
CM 
Dl 
CO 



CD 
CD 

oo 

00 
1 00 



o 

CO 
CO 

o 



Q 
CL 

q: 

X 



Q 

LL 

X 

I- 



< 

GO 



CD 



CO 

CM 

O 
CD 

CM 



CO 
CO 

CM 
O 
CM 



CO 

CO 
00 

T— 

N- 
CM 



CO 

00 
CO 



CM 



CO 
CO 



CO 

LO 
00 
00 
CO 

CM I T ~ 
< CM 



CO 

X 

CO 
CO 
CO 

o 

CM 



CO 

CO 

LO 

CO 
CO 

CM 



CO 
CO 

o 

CO 
CM 
O 
CM 



CO CO 

CO. tt> 



CO 
CO 
CNJ 



WO 2004/111197 



PCT/US2004/018492 



7/81 



CM 
00 

in 



CO 
LU 
II 

3 



oo 
o 

CO 

lo 
CD 
CD 

_Q CO 

CO CM 
CD CM 

o t: 



CO 
13 
(/) 

c 

0) 

to 
c 
o 

o 



o 
to 
o> 

CO 

X 

II 

o 
3 



00 



LL 

§ CM 
LO 

lo g> 

CM ^ 

CO LU 



CD 
< 



CO 

© 

II ^ 

LL ^ 

LU ii 

*.§ 

CD I 

Q O 



CM 



LO 



O 
CD 
< 



QQ 

i 

CO 

CD 
CM 

CM 
< 



(0 



o 

o 

E 
o 
_c 

"co 

CO 

2j 



CO 
CO 

cr 

CM 



CO 
O 
CO 



CD 

z 

CL 
< 

O 



O 

CT> 

c 

'CO 

jfL| 

CO 

o 

CO 
CM 

cr 



CM 



LO 

T— 

CO 
CD 

o 
o 



CD 

z 

0- 

g 



(0 



CO 
CD 
O 
00 
O 
CM 
< 



Q 

o 

CO 

c 

CD 

CL 
CO 
CO 

o 

E 
o 
X 



-1 

si 

o to 

CO CO 

2 -Q 

?i 
It 

.i £ 

^ LL 

CO C 
0) O 
5 CD 

. a> 

IX) 0) Z 

w a: 

co 

° -x 

w 5 CD 

< J5 ^ 
m E 5 



CD 

i ^ 
C = 



o 
o 



< 
z 



co 
c 

9> p 

to 

CO ^ 



o 

-Q 

CO 

9- co 



CD 

O 
CO 
Q 



CD 
CD 
CO 
CO 



O 
CM 
LU 
O 



00 
X— 
CO 

LO 



& =t ?T 



CD 

II 

LL 

LU 

a: 
x 



o 

CO 
Q 



^ CL 



Q 

CD 

o 

CO 

a 



3 CM 
p CM 

S.S 

1*1" 
1*1 

^ t- a 
cm 

0O 03 



CD 

o 



isi 



< V 
xi c 

CM 

CO 



LO 
CM 
CO 

X 
II 

CD 



co 

CM 



CO 

CM 
CM 
O 
CM 
CM 
CM 



CM 
CM 
00 
v— 
CM 
O 



CD 
CO 

O 
LU 
00 
O 

< 



CO 
LO 

o 

CM 

o 

CM 



CM 

CO 
r- 

CD 



CO 
LO 

a> 
o 

CD 



CL 

o 



CO 

oo 

CD 

oo 
o 

CM 



CO < 

CO £ 

~ E 

CO || 



CO 



^3- 

CM 
LO 
CM 

o 



g-lt 

ll 

II _ 

LL ^ 
IX! ° 

^ CO 

CM 
LO 
CM 
O 



o 
0) 
CL CO 

CO 



CO 

B 
II 
Q 
O 

a: 

Q_ 



CD 



.52 CO LO 



CM 
LL 



_Q 



CD 



CD LL 
CO LU 



II o 



Z ° 

-Q Q. 
O) CO 



cd 



% ° 

X s 

ULI CQ CD 

" 5 & 



CM 



cr 
CM 



5 

CM 
LO 
CM 
O 



3 

CO 



CO 

CO 
CM 
CO 
O 
CM 
CM 



WO 2004/111197 



PCT/US2004/018492 



8/81 



JCD 
CO 



CO 

E 

, o 

"O 
O 
CD 

E 
o 



o 

.CO 



o 

*i— 
O 
CO 

c , 
I CO ' 



lo 

CNJ 



! 

15 

.0 

">4— » 

0) 



CO 



CO 



01 

LO 



CO 
CM 

cm 



CM 
O 
CM 
CM 
O 
O. 



I CM 
I J£ 



_o 

O 



«+-« , 
c 

ii 

o 
o 

0) 

E 

°, 

CO 

x 

0 



CD 
CO 

5 

0 

E 
o 

CO 

!>» 

2 
a 

^+-» 
oj 
0 

CL 



2 

» 

CO 

•4— < 

CO 

o 
o 
c 
o 

0" 

CO 
3 

8 

D 
"O 
CO 

c 

CO 



CO 

c 
g> 

CO 
CD 
C 

14 

cu 



to 
0 



0 



a 

o 

0 

a: 



I CM 



CM 
CM 



CM 



CO 



CO 

o 

s 

o 

CN 



CT> 
LO 

O 

o 

LL 

CQ 



LO 
LL. 10. 



CO 

CO 

" x 

,co 

^ CD 

cop 

CO CO 

T- O 

0> CM 



CD 

LO 
CO 
LO 

00 
LL! 
00 



CO 

LO 
CM 

01 

I 

CM 

LO 
CM 



co 

CO 



CO 

°>, 

CO 
CD 
00 

o 

CM 



CO 
CO 
LO 
CO 

o 
o 

CQ 

< 



LO 
-J 

a 
o 

LU 

tr 



c 
0 

c 

CO 

"a 

0 

o 
o 

CO 
CO 
CO 



0 
SZ 

s 

CL 

LU 





CM 




CM 


CO CO 




CO rfr 




v- CM 


CM 


Q. CT 


C7 


cm 


O 







CO 

tfjco 

I LO 



CO 
CO 

o 

CO 



CM 



CO 



c 

CO 

m 
0 

-D 



0 



.CO 
CO 

LU 



I 

a 
c 

*co 

E 
o 



c 

T3 
C 

aj 
O 

sz 
■4— » 

0 
E 



CO 



CO 



CM 



CQ 



CO 



CO 


se, 


E 


E 


CO 


CO 


ogen 


o> 


~o 


nase 


dehy 


roge 


yde 




sz 




0 


sz 




ep 


yral 


0 




ehyd 


inobi 


■0 


E 


CO 


CO . 



CO 
CM 

dl 
1 

CM 
CM 
C7 



00 



CO 



CO 

o> 

CO 

o 
o 
o 



CO 
CM 



o 

CM 



CO 

Cl\ 
NT 
LL 

y: 

Q 

c 

2 
p 



o 

CL 

sz 



CM 

§ 

CQ 
c 

2 
2 
04 



CM 

cvi 

CO 



3 

Q 



CO 

CM 
CD 

o 

CM 



LO 

CM 
O 
CM 
O 



o 

CM 
LL 

T — 

CD 
CL 

E 



CO 

o 

CM 

00 
00 

V 



CO 

0 

£2 

CL 
Q 



-Q 

CO 
CL 

E 
o 

O QQ 

3 cl 

CO Q 

.ax 

of ^ 
E 



CO 



CM 

04 

CD 



CM 

T — 

CM 
O 
O 



DQ 

QL 

Q 

1 

5 

X 



0 
-Q 

E 
o 

Q_ 

CO 

0 

"T 
LO 

0 

O 

& 

c 
o 

CO 

> 

0 
O 

LO 

O 
Q 
O 



CM 
CO 



D5 

CO 
CM 
"3 
_I 

LL 



C 
0 

2 

CO 

o 
0 

o 

CL 



CD 
O 
CD 
CM 
CM 

C 
0 

o 

0 
^ CO 
CL Q- 



CO 
O 

'•+— » 
<D 

SZ 

z 

CL 



CD 

8 

c 

CO 

o 



CM 
O* 



CO 

LO 
CM 
CT 
I s * 



"co 


CO 


CO 


COf 


0 




CM 


CO 


LO 




O) 






0 


CM 


CM 



CM 

o 

CNI 
CNJ 
O 



00 

LO 
CM 
CD 



N 



en 
co 

CNJ 
CNJ 



DO 
LO 



LL- 



CS 

G) 
x— 
CM 



CO 

CO 

00 
LO 

OD 

CNJ 



CO 
LO 

co 

LO 



WO 2004/111197 



PCT/US2004/018492 



9/81 



TO 
CD 
_C 

ZD 
< 



X> 



(0 
"O 
X> 

E 
m 

< 2 

c ^ 

I" 

3 C 

E « 

.§■§ 



CO 
CQ 

CL 
CO 

iS 

0) 

'c 
o 

Q_| 

E 



CNJ 
CQ 



CD 
Q. 

& 

CO 



Q 

GO 

c 
> 

2 



CO 

g 
*c 

'CO 
C 

o 
o 



o 
E 

JB 

CO 

dJ 



o -8 o 
E lW 



o 
x 
n 

LL 
LU 



CM 

CO 
00 
CM 
00 

T- 

o 

CQ 

< 

-Q 
CD 

CO 

CD 



I s - 
O 



II 

a-* 
ii 

c Q 

'55 O 

u 

o o 
o Q 



IIll 



o 
c 

CO 
n 
co 

£Z 
CD 
CO 

c 

o 
O 



i< 
fc a: 

CO =t 



CD 
CD 
O 

m 

CO 

X 
II 

CD 

I s - 
o> 
m 
o 

CD 



II £ 

lu aj 
LU 

3 

Q 



CD 

c 

0) 

D3| 

O 
O 



o < 



CO 

.2 
*«+-» 

© 
x: 

"5 

CL 
>» 
XT 

CO > 
CD Z 

■q-q: 

CO c 

-co c 



CO 

o 

a. 



CNJ 
00 

O 



o 

E 
o 
X 
ii 

LL 
LU 
Q 



CM 
CM 

a> 

CO 

°U> 
II 

LL. 
LU 

a: 



-o Jj 



I 

CL 

"co 
o 



CM 
T CO 

o o 

If 

O) CL 



LL 

II ^ 

z 

LLi CM 



CD 



OS 
E 

LLi 



IS 

O 

CL O 
>% t- 
-C 00 



I- o 

CO JC 

LU CM 

ii ^ 

LL "S? 

^ CD 
II 



00 
00 

o 

CO 



LU 
CL* 

< Q 



CO 
CD 

3 



QL 
CD 
Q. 

lO 
CD 
CD 

CO 

X 

It 

CD 
13 

O) 
CM 
LO 
CO 
h- 
00 



X> 

O0| 

II 



CO <M 

00 LU 

' - '5) < 
ii ^ 



LU II 

d: uj 

CQ _l 

Q O 



o 
o 2: 



X> 
CDj 



CO 
13 
(0 

c 

CD 
W 

c 
o 

a 



CD 



CD . 

52 co 

co co 

S o 



— x* 

>> II 



CM 
t- 

(D 

E 

CD 



LL 

a 

s 

cm" 
E 

CO 



Q. 

a 

CD 



CO 
CO 

CM 
CO 



CM 

CM 
CO 
D* 

CO 



CM 
CM 
CL 
i 

CO 
CM 
Q- 
CM 



CM 
CM 
Or 
O 



CT1 



CO 
CM 



CD 
CM 

cr 

LD 



in 

CO 
LO 

in 



CM 
CM 
O 

o 



2 

X 

CD 



CO 
CQ 
Z 
CL 



N 
CO 

O 



CQ 



CO 

co! 

CM 
CM 
O 
1^ 

CM 



CM 



CO 
CM 

S 

o 

CM 



CO 
lO 

a> 

CM 
LO 

CM 
< 



CO 
00 
CM 
CO 
v— 

o 

CQ 
< 



CO" 

O 
X 

a: 



CO 



in 

CD 
CM 

CM 



O 

00 
v— 

o 



CM 

o 
o 



CM 
CO 

o 



CO 
CO 
CD 

T~ 

00 
CD 

CM 



CD 

CL 
0- 



CO 

CO 

CO 
CD 

a> 

00 

o 

CM 
< 



LO 
CD 

CO 

o 



CM 
LL 
CM 

a: 



CO 



CD 
O 
CM 



WO 2004/111197 



PCT/US2004/018492 



10/81 



Consensus includes gb:AI638762 /FEA=EST 
/DB_XREF=gi:4690996 /DB XREF=est:tt32a04.x1 
/CLONE=IMAGE:2242446 /UG=Hs.77965 peptidyl- 
prolyl isomerase G (cyclophilin G) 
/FL=gb:U40763.1 qb:NM 004792.1 


kangai 1 (suppression of tumorigenicity 6 ( 
prostate; CD82 antigen (R2 leukocyte antigen, 
antigen detected by monoclonal and antibody 
IA4)) (KAI1) 


hypothetical protein (FLJ 10007) i 


serumglucocorticoid regulated kinase-like fSGKL) 


karyopherin alpha 2 (RAG cohort 1, importin 
alpha 1) (KPNA2) 


heterogeneous nuclear ribonucleoDrotein R I 


HSPC125 protein (HSPC125) 


BRCA1 associated protein-1 (ubiquitin carboxy- 
terminal hydrolase) (BAP1) 


embryonic myosin alkali liaht chain (MLC11 ! 


interleukin 16 (lymphocyte chemoattractant factor) 
(IL16) 


hypothetical protein FLJ 12567 


cytochrome c oxidase subunit Va fCOX5A^ 


hypothetical protein FLJ21062 


G protein-coupled receptor, family C, group 5, 
member B (GPRC5B) 


ribokinase (RBSK) 


transcription factor 
















CO 






CM 






























8q12.3-8q13. 


CO 
CM 

cr 






T— 

CM 

a 


*> 


















T— 

CO 

cr 

CM 






11p11.2 


1p32.3 


17q23.1- 


1p36.11 


6q16.3 


3p21.31- 


17q21-q1 


15q26.3 




LO 
LO 


7p11.2 


16p12 


2p23.3 


1p22.3 




004792 






002231 


019073 


013257 


002266 


101449 


.014165 


004656 


CM 


004513 


024714 


004255 


024788 


016235 


022128 


03600 




z: 






NM 


NM 


NM 


IAIN 


BCC 


NM 


NM 


M36 


IAIN 


z 


z 


NM 


IAIN 


z 


BCO 


PPIG 




SPATA6 


SGKL 


KPNA2 


HNRPR 


HSPC125 


BAP1 


MYL4 


IL16 




COX5A 


FLJ21062 


GPRC5B 


RBSK 


LM04 




«♦-' 
<° 

CO 






to 


■Jo, 


« 


to, 


to 

0) 


•4— ' . 




to 

x; 


CO 
CO 


-»-» 


to 

to 


S 


■4-* 

CO : 
CO 


CO 


CO 
CO 




208994 






203904 


220299 


220038 


201088 


99Z80Z 


9006 IZ 


201419 


8800 IZ 


209827 


m 

<— 

3> 
r- 

CM 


CO 
CO 
CO 
CO 

o • 
CM < 


m 

LO 

j> 

N 


203632 


rsi 

CNI 
"C — 

CVI 


LO 
O 

O 



WO 2004/111197 



PCT/US2004/018492 



11/81 



small inducible cytokine A5 fRANTESl f.snYAm 1 


period homoloq 1 (DrosoDhiia) 


KIAA0480 gene product 


ESTs, Highly similar to SRA4JHUMAN CTD-binding 
SR-like protein RA4 


capping protein (actin filament) muscle 7-line h<*ta I 


chemokine (C-C motif) liaand 5 


CD2 antigen (p50), sheep red blood cell receptor 
(CD2) 


KIAA0924 protein 


testis specific leucine rich reoeat nrntein rrsi RP^ 


phosphatidylinositol alvcan. class O rpmoi 


kinectin 1 (kinesin receotor) 


hypothetical protein FLJ20531 


thyroid hormone receDtor interantor 13 rrRiPi^ 


deleted in malignant brain tumors 1 (DMBT1), 
transcript variant 1 


glutamyl-prolyl-tRNA synthetase 


NUDEL; endooliaoDeDtidase 


CD 

s 

■ Q 

O 
- -C 

)§ 

o 
sz 
a 

CL 

< 

o 

I 

o 


signal transducer and activator of transcription 1 
91kD (STAT1) 


bromodomain containina 4 


3 


hypothetical protein FLJ22489 


protein kinase C, delta binding protein 


17q11.2-q12 


17p13.1- 
17p12 


1p36.13-q41 


CN 


1p36.1 


17q11.2-q12 


1p13 


17q22 


8q24.22 


16p13.3 


14q22.1 


1q44 


5p15.2 


10q25.3-q26.1 


1q41-q42 


17p12 


15q15.3 


2q32.2 


19p13.1 


13q13.3 


7q21-q22 


11P15.4 


NM 002985 


AF022991 


NM 014810 


AA648521 


U03271 


M21121 


NM 001767 


NM 014897 


NM 012472 


NM_ 004204 


BF589024 


NM_017865 


NM 004237 


NM 004406 


AI475965 


NM 030808 


BC003418 


NM 007315 


BF718610 


NM 005830 


AK026142 


AI088622 


CCL5 


PER1 


CAP350 




DQ 
N 
CL 
< 
O 


LO 
I 

o 

D 


CD2 


KIAA0924 


TSLRP 


a 

0 


z 

u 

* I 


CO 
LO 

o 

CM 
□ 

x j 


CO 
r— 

CL 

DC 
- 


DMBT1 


EPRS 


_i 

D 
2 


a> 
i 

CL 
QL 

X 
< 


STAT1 




CO 
CO 
CL 
K l 
? I 


CD 
< 


PKKCDBP I 




CO 


c3| 

CO 


+-> 

CO 

w 


fi ' 


■♦— ' 

CO 


CO • 


+-* 

C0 | 


■4— ' 

CO 


CO 

CO 


s - 

X 


to 

CO ( * 


ro 


"5 
w 


4-* 

CO ■ 

co | 


8' 

CO 


CO 
CO 


-4-» • 

CO 
CO 


(0. 
CO * 


CO 


co ' 


CD 


204655 


36829 ; 


ro 
<. 
r> 

O 

N 


222311 


N • 

li 

O ' 
t i 


o 
z> 

— 


T — 

CO ' 
OO J 

8 

?\ 


35 
O 
O 
D 
N 


206483 


r- < 

C 

D C 
N C 


r- < 
35 C 
D C 
D C 
M C 


r- ( 

O ( 

o c 

M C 
N C 


o 
o 

ZD 
3" 
D 
N 


208250 


CN , 

St 

00 J 

8 


O ( 
3> < 
Z> < 

o << 

M C 


N 
r- 

3 


200887 


N ' 
D C 
— C 
M C 
D i 
N C 


o r 
— ^ 

NJ C 


» ( 

^ ( 

*r c 

NJ < 


c— 

~o 

— 



WO 2004/111197 



PCT/US2004/018492 



12/81 



T-cell receptor aberrantly rearranaed aamma-chain 


API5~like 


ribosomal protein S20 


thyroid hormone receptor-associated protein 
complex compohent TRAP240 mRNA. 
/FEA=mRNA /DB_XREF=gi: 10435894 /UG=Hs.4084 
KIAA1025 protein 


ribosomal protein L35a 1 


RNA binding motif protein 8A 


protein phosphatase 1 f regulatory subunit 7 


lactotransferrin (LTF) 


KIAA1096 protein 


hypothetical protein MGC35048 


KIAA0227 protein 


tumor necrosis factor (liaand) suDerfamilv. memhw m 


coatomer protein complex, subunit atoha 


epidermal growth factor receptor (erythroblastic 
leukemia viral (v-erb-b) oncoaene homoloa. aviarrt 


ADP-ribosylation factor-like 5 (ARL51 


tumor necrosis factor (ligand) superfamily, member 
10 


ralA binding protein 1 


KIAA1096 protein 


7p15 


CM 
t~ 

O" 
i 

CM 

a. 


8q12 


12q24.21 


i_ 

CD 
•♦— » 

a* 
i 

o> 

CM 

cr 

CO 


CM 

cr 


2q37.3 


3q21-q23 


1q23.3 


16p13.13 


CM 

cr 
^ > 

"C — 


to 

Csl 

cr 

CO 


1q23-q25 


CM 
Q. 


2q23.3 


3q26 


18p11.3 


1q23.3 


M13231 


NM 006595 


AL576654 


AK023837 


AW517686 


AI738479 


BF718769 


NM 002343 


AW238632 


AW070229 


D86980 


AW474434 


AI621079 


AW1 57070 


NM 012097 


NM 003810 


AW025261 


AL096857 


TRGC2; 
TCRGC2; 
TRGC2(2X); 
TRGC2(3X) 


API5 


RPS20 


i 

KIAA1025 


RPL35A 


RBM8A 


PPP1R7 


LTF 


KIAA1096 


MGC35048 


KIAA0227 


TNFSF10 


VdOO 


EGFR 


ARL5 


TNFSF10 


RALBP1 


KIAA1096 


215806 x at 


201687 s at 


CO 

w 

CO 
CM 
CM 
CM 
v- 
CM 
< 


212208_at 


212136_at 


2141 13_s at 


213465 s at 


C0 
O 

s 


« 

X 

LO 
LO 
O 

CM 


CM < 

CO 
CO ( 
r • 
CM < 


^- 
ro 

r- 

N 


214329 x at 


"co 

i-» 
:o 
ro 
si- 
r— 
N 


201983 s at 


218150 at 


202688 at 


202844 s at 


1211948 X at I 



WO 2004/111197 



PCT/US2004/018492 



13/81 



O) 

c 

E 

x 
o 



O 

c 
o 



o 

CO 

c 

CO 



CO t- 

J22 X 

O L 



c 
o 

0 



c 
"5 

o 

I 

CD 
_Q 

"D 
0) 
O) 
C 

5 

CO 
CD 



CD 
O 



0 

o 

h 1 O 



c 
o 



o 

CO 

c 

CO 



CO 

> 

CO 
T5 

c 

CO 
I— 
0) 

o 

TJ 

CO 

c 

CO 

— CO 
CO Q 

*CO O) 



c 

C 



CM 

rsl 
X 

S 

oo 



c 

CD 

2 

Q4 

l— 
CD 



I 

Q-l 

CO 

CD 
O 

I 



CD 

c 
c 

CO 
3 

DC 



3 

cr 
la 

3- 



3 

E 

0) 

c 
0 

CD 
CO 



CN 
. . O 
CO O 

.E a 

CD 

2 co 

Q-X 

CD _ 
O) CD 

■M | 

.E co 

O CN 

•t E 

£ i 



s § 

03 co 1 

0 CO 

> o 

3 i 

3 O 
CL 



E 

*C0 

o 

I 

CO 

E 
E 

CO 
O) 

1— 

o 

» 

0 

o 
0 



8 



c 
o 

CD 
CO 

o 

■4— ' 

o 

*D 
CO 
0 



o 

CO 
CO 

n 

c 
a 



o 

E 
o 
X 
II 

LL 
UJ 
Q 



0 
c 
o 

o 



< 

t- Q < 

CD°Z 

o tr a: 
^- 0 c 

g I 

< 5 u. 

■Q c ^ 
°> 0 o 
co 5J2 

(D — ^ 
3 CM 
CD 

.E Z til 
K CD 
E < 

col 

ii 

CO — * 
co Hi 



o 

E 
o 
X 

a> 

CD 

h- 

CN 
CO 

CO 

X 
II 

CD 
co 

CM 
00 

co 
oo 

CD 
ID 



0 
C 

o 



Q 

o 

0 

CO 

c 



O) 

c 

0 



o 

CO 
.XT 
CM 
CO 
CO 



< 

OD^ LU 
CD 

LU 

a: 



co 

i§ 

CO LU 



CO 

cr 



CO 
CO 

c\i 
co 
oi 



co 



CM 

CM 
CO 
O" 
CM 



CO 

CO 
CM 
Q. 
CM 



CM 

al 
co 



CM 
D" 
i 

CM 
CM 

dl 

CO 



CM 
CO 



OO 



31 
CM 



CL 



CM 
CM 

cr\ 
co 



CM 

>< 



to 

CM 

7 

CM 
Cr 
LO 



o 
in 

CM 

o 

o. 



CD 

in 



in 

CO 
CD 



CD 

in 
m 

CO 

o 
o 
O 



CM 
O 
CM 
CO 
CD 



CO 
CO 

h- 

CM 



o 

00 
CM 

H 



CD 
O 

CO 
O 



CM 
Li. 
Q 
O 



X 

CD 



CO 



N 



CL 
(0 



in 

CD 

a 

LU 
OS 

a. 



CD 
DC 



O 

o 



co 
to 
in 

00 
LO 
CM 
O 
CM 



CO 

X 

in 
o 

v- 

CM 
< 



CO 



CO co 

X ^ co 

£ S N. 

< X 2 



co 
o 

CM 



CO 

CD 
CD 

CD 
r— 
CM 



CO 

CO 

O 
CM 
CD 
CD 



CO 

co 

00 

CD 
r- 

CM 



CO 
CD 
CD 

in 

v— 

CN 



WO 2004/111197 



PCT7US2004/018492 



14/81 













(B56] 






tein 




:ory subunit B 




(FLJ2151-h 


associated prol 


r 


protein phosphatase 2, regulal 
gamma isoform (PPP2R5C) 


interferon, alpha 16 (IFNA16) 


hypothetical protein FLJ21511 


syntrophin, beta 2 (dystrophin- 
A1, 59kD, basic component 2 


KIAA0683 gene product 








CO 
CM 

cr 


CM 


CM 
Q. 
CO 


9p22 


4p12 


16q22- 


16p13. 


_002719 


002173 


025087 


006750 


14583 


IAIN 


IAIN 


NM 




ABO 


PPP2R5C 


IFNA16 


FLJ21511 


SNTB2 


KIAA0683 | 


■b 

CO 


4— ' 

CD 
X 


*-> 

CO 


to 

CO . 


03 

X) 
N 

^O 


201877 


DO ' 
<fr < 
I 

X) ( 
3 ( 
N ( 


sr 

N 

"s. 

D 
N 
N 


205315 



WO 2004/111197 



PO7US2004/018492 



15/81 



s 

CM 

<D 
CO 

o 

& 

Q-| CO 



I 

CD 
CL 

CO 

<i> 
c: 

.o 

E 
a) 

E 
to 
c 

CO 



0 
c 

CO 

V— i 

-Q 
E 

CO 

E 

0) 
_Q 
CO 
-Q 

2 

CL 
CO 

So 

CO 
CO 
Q 



2 

CL 

c 

2 c 

CL CD 

CO c 

1 <° 

- 

E "3 

c o 

CD CO 

CL CO 

V) CO 



+ 



o 
a 

c 
to 



>* E 

■+-» 
g co 



to 
a> 
■o 
o 

£Z 
CL 
CO 
O 

sz 

CL 

s 

CO 
JZ 
CL 
10 

o 

CL 



45 



< 
CN 



-9 
c 

<D 
CO 

f 8 

cl c 

CD *~ 

J CO 

>* 
o 



< 

CNI 

Q 

O 



in 

CT 



CL 

o 



CM 
CO 

7 

CO 
CM 
CT 



CM 
CM 



CO 



CT 



CO 

CO 
CM 

cr 
o 



co 
co 
co 

CO 

o 

< 



CO 
CO 
CO 



in 
co 

CO 
CO 

3 



a: 
to 

CL . . LO 

. . T- CO 

oc a: jo 
co co 2 
dq5 

H H S 
E CL * 



CM 



CO 

CO 

CM 
CM 
h- 
CM 



lO 



3 

CO 

s 

o 

CM 



CO 



CO 

en 
co 

CO 

8 



< 

CM 

Q 
O 



co. 



CD 
CO 

o 

CM 



WO 2004/111197 



PCT/US2004/018492 



16/81 



1 rhodanese, 1 


KIAA0121 protein 


sulfatase FP I 


|DKFZP434J21 4 protein 


Huntingtin interacting protein B ! 


ribosomal protein S20 I 


GABAnoradrenaline transporter solute carrier family 
6 


hqp0376 RNA helicase-related protein 


Homo sapiens BX1 mRNA. partial cds ! 


T-cell receptor rearranged beta-chain V-region (V-D- 




prostate differentiation factor I 


zinc finger protein 323 


ribosomal protein L22 


hypothetical protein MGC3178 


prostate epithelium-specific Ets transcription factor 
(PDEF) 


hookl protein (HOOK1) 


carbohydrate (N-acetylglucosamine 6-0) 
sulfotransferase 6 (CHST6) 


hypothetical protein FLJ23514 iTLJ23G1/n 


hairy/enhancer-of-split related with YRPW motif 1 I 


hypothetical protein FLJ 1 2442 "1 


u 
c 

CD 

CO 

I s 

3 

E CD 

•> m 

hB 
s o 

i* 

— C 
1) O 

S8 

2 o 
= $2 

Z CO 


EST, Weakly similar to 2109260A B cell arowth fantnr 


highly similar to AF1 19857 hypothetical protein 
PRQ1855 


Tho2 


peroxisomal biogenesis factor 14 


ESTs, Weakly similar to hypothetical protein FLJ20489 


CO 

cr 

CM 
CM 


|3p25.2 


|8q13.1 


|3q26.1 


|3q22.3 


|8q12 


Xq28 


17q22 




14q32.33 




CM 
CO 
1 

X— 
CO 

Q. 

CO 


CO 


CM 

CO 
CO 

CL 

i 

CO 

CD 
CO 

o_ 


CM 

id 

CM 
CL 
CD 


6p2!3 


CM 
CO 
CL 
x— 


16q22 


cr 


8q21 ] 


3p21.31 


6p21 


15q24-q25 


CO 

CO* 

CSJ 

cr 


CO 

CD 
CM 

7 

LO 
CM 

3 


CM 
CM 

CD 
CO 
Q. 




D87292 


ID50911 


IAW043713 


AL556438 


AI761110 


AL576654 


U17986 


AF078844 


BE867771 


M15564 

■- — 


BE250348 


AF003934 


AW086021 j 


BG1 52979 | 


o 

co 
o 

CO 

o 


NM 012391 


CO 
00 
CO 
lO 
v— 

3 


lO 

T— 

CD 
CM 

5 


r- 

CM 
CO 

T- 

CM 


R61374 


D87433 


AI524138 


AL1 09698 


AK025328 


BG403671 


AB017546 I 


AI004009 


TST 


8 


KIAA1077 


CM 
CO 

2 

SI 
LL 

Q 


CQ 
CL 
>- 
X 


o 

CM 
CO 
CL 

X 


SLC6A8 


RNAHP; RHELP; 
SF3M25 




IGHM 




dq 

S 

a. 


CO 

cm 
ro 

_L 

z 

si 


:m 

1. 


CO 

N- 

ro 
O 

^ 


PDEF 


r- 

o 

0 

r 


CHST6 


lO 
ro 
CM 

3 


l _ 
>- 

S! 


-L U 


TFEB 




PR01855 


CM 

o 

X 
— 


X 

a. 




ro 
to 

o < 
CO ( 
CO < 
O ( 
CM i 
=tfc C 


CO 
CO 

J) 

7> \ 
o < 
N ; 

N I 


CO- 

^ i 

-O < 

ro t 
N < 

c— ^ 

"M ( 


*-» 

o < 

D ( 
D ' 
N < 
c— i 
M < 


GO 

5! 


*—> 

•l 

5 

N 
N 
N 
— 


210854 x at 


3210524 x at 


CO 

CO 

b 
j> 

i— 

>J 


CO 

£ < 

CO C 

o r 

Pi 


\~* 

CO 

o 

"M 

51 


v. ^ 

o c 
r- < 

"M C 
N C 


S |* 
"l 

O I 

- r 

d r 

*M i 
"M C 
N C 


S ' 

X 

O C 

^ I 
^ c 

M 1 
M C 


i— > 

CO 

CO 

o 
o 

N 

M 
M 


220192 x at 


B 

D 

3) 
7) 

M 


CO 

o> < 

■ lO ( 

o c 

T~ ( 

CM C 
CM C 


5' 

x> c 
o c 
d r 

M " 
M " 


t— • 

05 

o r 
o c 

^ c 
T c 


»— < 
to 

^ CO 

o 

3" CO 
O CM 
O CO 


o 

w | 


37590_g_at 


"l 
W | 

T" < 

CO c 
CM i 
CM C 
CM C 
CM C 


CO . 

"M 
"N ( 
— ( 
N \ 
N < 
*M t 


>— < 

5 

D 


222288 at 



WO 2004/111197 



PCT/US2004/018492 



17/81 



CO 
CO 



UL 

c 

B 
2 

CL 

s ^ 

Q- — I 

>»LL 



C 

cd 
2 

CL 
CO 

T— 

I 

>;*: 



t 

CD 
CD 

<D~ 
C 

a 

P 
CL 
CD 

CD 



o 

CL 
CO 



0) 
CO 

r: 

CL 
CO 
O 
JZ 



(j) CD 

IS I 

o co 



c 

a) 

2 

CL 

a) 
o 

— 
CO I 

in 



co 
o 

X It- 
cd'*" 

§1* 

LL o 

Q < 



si 



C 

£ CD 

ill 



CD 

CL 
>> 
O 5= 
CL CO 

cd CD 
E <° 

CD 
(O 



to 
c 

I 

% E 

CO 03 

to 

^ o 



CO 



CL 

co 

J. 

2 

CD 
O 

c 

CD 

.c 
c 
CD 



cr* co 

T5 C 
C O 
O -C 

JC O 

o o 

II 

CD O 

to y 
iS g 

CO O 

I— 

CO 

CD 

(Z o 

3 



<2 CM 

fc w a. 



° 
i_ i 

CL 

-J* CM 

S 5 

p-Iq 
x 



CD 

E 

.2 

|cT 

CD 
■ w 

| CO 

3|£ 
-Q =3 
3 CO 

.Elf 
co E 
o 

8 s 

S£ 

5.1 8 

O Q 

co !£r 

. w 



S e 
« Lb rrt 

I co CO 



I 

0)1 c 

is r" 

CD 



5 E 



a: .£ 



CO 
LL 

CO 
i_ 

o 
o 

c 

o 



3 



o 

15 



o 

JO 
CO 

o 



c 

CD 

•o 
c 

CD 
CL 
CD 



P 



CD 
C 

a) 
m 

CO 

o 



CD 
(0 



CD < 



CO 



CO 



CL 



CM 
CT 



CM 



CM 

cm" 

CM 
CT CL 
CM 



co in 
co 



CL 

i 

ro 



CT 

CM 



CM 
cr 



co 



aJ. 

lOr 
CO CL 
OJO 
t- CM 



CM 



CM 



on 

CD 



CM 
CO 
CM 
CM 
CL 

X 



5 K 

• CO 

co • 

CO 



CL I 
CD 



CM 
cr 



CXD 

CM 
CM 

'3 

•2. 



CO 

co 

CD 
O 
LL 
< 



CO CD 

a> o 

03 O 
t— CM 

co|| 

<l< 



co H- 
co in 

CO CD 

sb 



O 

o, 
2 



CO 

m 
cm 

CM 



5S 



2 H 



< 



CD I 



(ON 
CM CD 

'co 

s 



CM 

oo 
^3- 



co 

GO 
CD 
O 
O 



CD 

LU 
r- 



Q_ 
CD 
O 

. - CD 

CJ rr 

CL Q 

CD W 

CD CD 



CM [r- 
CL 



CO LL 



CD 



in 

-|| 

— 

Ll jO 



CO 

I 



CQ 
LL 



< 
CL 



ra l 






«*— « 




CD 


CM 


CO 




m 


-r- 


co 


CO 


h- 


x— 


T — 


CM 


CM 



o 1 

CO 
CM 



CO Lw 

in 1 
co in 
h- m 
m n. 
m 

CM k 
CO CM 









CO 


3 


CM 


CM 




a> 


o 


CD 


CD 






? 







C0.U-. 

CD ""I 

m | co 



CD 

in 
co 
co 

CM 



O) 
CO 
CO 
CO 

CM 



•J. 

in co 
a> m 
co co 

CO CO 
CM I CM 



10, CO, 



to 
r-. 

CD 

co 
o 

CM 



CM 
O 



CO. 



CO 
CD 

S 

CM 



CD 

S 

CM 



WO 2004/111197 



18/81 



PCT/US2004/018492 



to 
CO 

c 

XL 

c 

Q) 

e 

CLj 

LU 
< 



CD 



cu 

to 

to 

o 
n 

Q. 

CO 
O 
JC 
Q. 

0) 
to 

CO 



Q. 

CO 

o 

CL 

s , 

Q- o 

CD *f= 

s g 

O £ 
o § 

si 



CD 



O T"~ 

< oj 

B% 
2 8. 

.£ o 
c 



-Q 

to c 
o *= 
o 

o CO 
O Q- 



V) 



X 

o 



s 



0) 
CO 

o 



CD 
CO 
CO 
C 

B 
2 

Cl 

<D 
C 

S 

"i 

o 5 



E 
2 

o 

>i d) o 
o w 



eg 

CJ)j-D 



>v 

■o 

E 

C 

CO 

<D 
,> 

1 

i 
2 

CL 



CL 
GQ 

o 

c 

0) 

2 

CL 

D> 

E 

TJ 
C 

JQ 

< 

a: 

0) 
_Q 

O 
3 

■o 

C 

8 



II 

c 

jg CO 

s I 

— a. 

>* - 

1 « 

ii 

3 o 

CO o 



<D (D 

2 Q-CQ 

SZ CD x- 

o cl a. 
o >>> 

£££ 



8 

0) 

i 

c 



C 
(0 

o 
'c 
o 
c^ 
.cj in 

I a 

O 3 
C O 

o ® 

co 2 
u E 



CD 



CO 

m OJ 
.2 CO 
TJ C 

0J o 

i » 



II 

o 
w 

CO 
CO 



CO 

a) 
a. 

i_ 
a> 



3 

.g* 

3 



05 
O 
O 

E 
o 

-C 

to" 
c 

CO 
CO 

a> 
O 

CL 
< 

z: 

CO 
CL 



CD 



CO 



CO 



CM 



Q_ 
CO 



CM 
D" 



CO 

T — 

cr 

CM 

T— 

cr 



CD 
CO 
CO 

5 
o. 



in 

CM 

cr 

4 

CM 

X 



CM 
CL 
CM 



CM 
CO 

o 
o 

CM 



CM 
CO 

CO 

cr 



CO 

CM 
CL 
CD 



CO 
CO 



CL 
CD 



CM 

CM 
t— 
O" 
CM 
CM 



CO 
CD 
CO 

o 
o 



< 
o 



CL 
CO 
=3 



Ql 
< 

-z. 

CO 



00 
03 

O 

3 



00 
CO 

o 

CM 



WO 2004/111197 



PCT/US2004/018492 



19/81 



tissue inhibitor of metalloproteinase 1 (erythroid 
potentiating activity, collagenase inhibitor) 


ribosomal protein S13 (RPS13) I 


calpain 9 (nCL-4) (CAPN9) I 


NGFI-A binding protein 1 (ERG1 binding protein 1) 
(NAB1) 


potassium voltage-gated channel, Isk-related family, 
member 1 (KCNE1) 


matrix metalloproteinase 16 (membrane-inserted) 
(MMP16), transcript variant 2 


zinc finger protein, Y-linked (ZFY) | 


cytochrome P450, subfamily IIA (phenobarbital- 
inducible) 


platelet activating receptor homolog (H963) I 


solute carrier family 4, sodium bicarbonate 
cotransporter, member 7 (SLC4A7) 


scieliin (SCEL) I 


cytoskeletal type II keratin 


connective tissue growth factor 


LIM domain | 


LIM domain 


peptidyl-prolyl isomerase G (cyclophiiin G) 


mel transforming oncogene (derived from cell line 
NK14)- RAB8 homolog 


immunoglobulin superfamily. member 4 j 


melanoma antigen, family D ! 


H2B histone family, member S (H2BFS) 


putative transmembrane protein . (HS1-2) 


transcobalamin I (vitamin B12 binding protein, R 
binder family) (TCN1) 


diaphanous (Drosophila, homolog) 2 (DIAPH2), 
transcript variant 156 


matrix metalloproteinase 10 (stromelysin 2) 
(MMP10) 


claudin 10 (CLDN10) j 


Xp11.3-p11.23 


11p15 i 


co 

CN 

cr 

i 

T— 

T~ 

CN 
cr 


CO 
CO 

£ 

CN 
CO 


21q22.12 


CN 
cr 

00 


Yp11.3 | 


19q13.2 


3q26.1 | 


CN 
CO 


CM 
CN 

cr 
co 

T— 


CO 
CM 

| 


CO 

CM 

cr 
CO 


1p22.3 | 


1p22.3 | 


2q31.1 j 


CO 

CL 
CO 


CN 
CO 
CN 
CT 

T— 
T— 


CO 
CM 

v- 

Q. 

X 


CO 

CN 
CN 

cr 

v— 

CN 


CD 
CN 

cr 

LO 
r— 


CM 

• CT 
i 

T— 

cr 


Xq22 


11q22.3 


co 
cr 
i 

T— 

CO 

cr 

CO 


NM 003254 


o 

o 
o 

2 


919900 IAIN 


NM 005966 


NM_000219 


CD 
LO 
. CN 
CN 
O 

2 


NM_003411 | 


CN 
CO 
h- 
O 

o 
o 

2 


CO 

o 

CO 
CO 

1 

2 


LO 

T— 

CO 
CO 

o 
o 

2 


CO 

CO 
CO 

o 

3 

z 


J00269 1 


M92934 \ 


BC003600 ! 


AI824831 ! 


CN 
CO 

o 

i 


BC002977 


CO 
CO 

co 

1 

2 


AF217963 | 


LO 

1 

2 


O 

r- 
o 
o 

2 


NM 001062 


63Z900 IAIN 


LO 

CN 
N" 
CN 
O 

3 

2 


s 

CO 
CO 

o 
o 

z 


— - ■■ ■- ■ ■ ■ 

TIMP1 


RPS13 j 


CAPN9 | 


NAB1 


KCNE1 


MMP16 


ZFY i 


CYP2A6 


H963 ! 


SLC4A7 


SCEL 


KRT6A 1 


CTGF ! 


LM04 | 


LM04 i 


PPIG I 


MEL 


IGSF4 i 


MAGED1 


H2BFS ! 


ABHD2 


TCN1 


DIAPH2 


MMP10 


CLDN10 


201666 at 


20001 8 jt I 


co 
co 

CO 
CD 

o 

CO 

o 

? 


+-* 
CO 

o 

CO 

o 

CN 


"co 

LO 
00 

o 


to 

CO 
CO 

5 

o 

CN 


207246_at | 


*-* 

CN 

o 


LO 

o 

CN 


to 

CO 

o 

fS 

o 

CN 


CO 

to 

rt 
CO 
00 
CD 


+-* 

CO 

LO 
CM 

CO 

s 


to 
o 

T- 

O) 

o 

CN 


to 

CO 
LO 

o 

CM 
CO 
O 

? 


CO 

s 

CM 
CO 

s 


*' 

CO 
CO 
CO 
CO 

o 

p 


co 

CO 
T"" 
00 
00 

o 

CN 


™l 

x— 
CO 
O 
CO 

8 


CO 
N* 

T— 

o 

CO 

8 


to 

3 

CO 

LO 
CO 

o 

CM 


CO 

CO 
CO 
LO 
LO 

o 

a 


•4-1 

"I 
CO 

LO 
LO 

o 

B 


to 

CO 
CM 
h- 
LO 

o 

CM 


■*— » 

CO 

o 

CO 

CO 
LO 

o 

CN 


|#205328_at 1 



WO 2004/111197 



PCT/US2004/018492 



20/81 



amphiregulin (schwannoma-derived growth factor) 
(AREG) 


Kallmann syndrome 1 sequence (KAL1) | 


prostate stem cell antigen (PSCA) j 


glutamate decarboxylase 1 (brain, 67kD) (GAD1), 
! transcript variant GAD67 


src kinase-associated phosphoprotein of 55 kDa 
(SKAP55) 


CD2 antigen (p50), sheep red blood cell receptor 
(CD2) 


mammaglobin 2 (MGB2) | 


aldo-keto reductase family 1 | 


heparin-binding growth factor binding protein 
(HBP 17) 


dystrobrevin, alpha (DTNA) % | 


sulfotransferase family, cytosoiic, 2B, member 1 
(SULT2B1) 


homogentisate 1,2-dioxygenase (homogentisate 
oxidase) (HGD) 


smooth muscle LIM protein (h-SmLIM) | 


lymphocyte antigen 6 complex, locus D (E48) | 


RAN binding protein 17 (RANBP17) j 


4q13-q21 


Xp22.32 i 


8q24.2 I 


2q31 


CM 
CO 

CM 
cr 

T— 


1p13 


11q13 ! 


T— 

CO 

T- 

co 
cr 


LO 

Ql 
i 

CD 
Q. 


18q12 


19q13.3 


CO 
CM 

7 

T— 

CM 
cr 

CO 


CM 
cr 
CM 


£ 
cr 
t 

CM 

cr 

CO 


^- 

CO 

cr 
m 


NM 001657 


NMJJ00216 | 


CM 
h- 
CD 
LO 
O 

5 

z 


00 

o 
o 
o 

Z 


NM 003726 


I s * 

f— 

O 

9 

Z 


o 

a 

o 

i 

Z 


CO 
CO 
CM 
O 
CM 

J 

z 


NM_005130 


CM 
CO 
CO 

T— 

o 
o 

z 


NM 004605 


N- 
00 

o 
o 

3 

z 


U46006 


NM 003695 


NM 022897 


AREG 


KAL1 ! 


PSCA | 


GAD1 


SCAP1 


CD2 


SCGB2A1 | 


AKR1B10 I 


HBP17 


DTNA | 


SULT2B1 


HGD 


CSRP2 


E48 


RAN BP 17 


205239 at 


to 

CD 
O 
CM 
LO 
O 
CM 


|205319_at J 


3 

00 

& 

LO 
O 


5 

CO 
I s * 
LO 

o 

B 


CO 

CO 
00 
LO 

o 

CM 

< 


-»— » 

5 

CO 

& 

LO 
O 
CM 


T— 

CD 
LO 
CD 
O 
CM 


■4— » 

CO 

^* 
o 

LO 

o 

CM 


-4— < 

°l 
CO 

S 

LO 

o 

CM 


ro 
w 

CO 
LO 
h- 
LO 
O 


ro . 

CM 
CM 
LO 
O 

8 


|211126_s_at 


|206276_at 


|#219661_at 



WO 2004/111197 



PCT7US2004/018492 



21/81 



















tein • ! 


[100 kDa) 


Factor 






I CCAAT/enhancer binding protein (C/EBP). beta I 




nsferase 




































beta 




Description 




hypothetical protein FLJ10936 


lalanyl (membrane) aminopeptidase 


I ribosomal protein S20 


I ribpsomai protein L4 


MAA0692 protein 


I ESTs, Weakly similar to neuronal thread Droi 


I thyroid hormone receptor-associated Drotein I 


I prostate epithelium-specific Ets transcription i 


[hypothetical protein BC001096 


I ribosomal protein L17 


IKIAA0052 protein 


O-linked N-acetylglucosamine (GlcNAc) trai 
(UDP-N-acetylglucosamine:polypeptide-N- 
acetylglucosaminyl transferase) 


Unknown 


'rhodanese 


< 

z 
z 
E 

< 

< 
c 

3) 

5 c 

i 8 

5 2 
v o 
z sz 


Similar to hypothetical protein 


similar to rat myomeqalin 




hqp0376 protein helicase-reiated protein 


metallothionein 1H-like protein 


KIAA0705 atrophin-1 interacting protein 1 


neurogenic extracellular slit protein 


hypothetical protein FLJ22569 


hypothetical protein DKFZp761F2014 


hypothetical protein FLJ23384 


pp21 homolog (LOC51186) 


hypothetical protein FLJ22471 


Kruppel-like factor 4 (gut) 


capping protein (actin filament) muscle Z-line, 




|Map 




|12q13.3 


|15q25-q26 


CM 

cr 
oo 


1 1 5q22 


CM 


CM 


|17q12 


CO 

T— 

CM 
CL 
CD 


|4p15.1 | 


]18q21 | 


CO 

cr 
o 

CM 


5q11 | 




10q23.31 | 


22q1.3.1 


9q31 


T— 

CM 

CD 
CO 

CL 

i 

CO 
00 

CD 
CO 
CL 


1q12 | 


17q22 




CM 

cr 


CM 
in 

CL 


12p11 | 


14q32.2 | 


CO 

CD 
CO 
CL 

T— 


Xq22.1 


12q24.31 \ 


9q31 | 


T— 

CD 
CO 
CL 

T— 




| Genbank 


IAC004832 


CD 
N- 
CN 
00 

5 i 
z 


09I400 IAIN I 


IAL576654 . | 


|BC005817 


AB014592 


IBG391282 | 


IAI023317 | 


|AI745526 | 


W87466 I 


BG1 68283 j 


|AL564683 | 


D29641 | 


AF070560 


AF131820 | 


ID87292 


AF313911 


BC001709 ] 


AB042557 I 


AF078844 


AF333388 ] 


AB014605 I 


AF055585 ! 


w 

CM 
CD 
CO 
CM 

o 
Z 


in 

T" 

CM 

o 

CM 
°l 

Z 


00 

in 

CM 


NM 016303 


o 
^~ 

m 

CM 

o 


BF514079 | 


U03271 i 




Common 




FLJ 10936 


ANPEP 


o 

CM 
CO 
D_ 

a: 


RPL4 


KIAA0692 




TRAP 100 


LL 
LU 
□ 
CL 


LOC92689 


RPL17 j 


DO 
CL 
CD. 
LU 

o 


PPAP2A 




GHITM 


f- F 

PS 


TXN 


FU13052 | 


PDE4DIP j 


RNAHP; RHELP; 
SF3M25 




< 


CM 
- 

Ij 
7) 


5 

UJ 

III 


T— 

o 

CM 
LL 

T— 

CO 
CL 

N 

JL 


AG MAT | 


LOC51186 


FLJ22471 | 


£ 


CAPZB | 


|53003_analysis 


bystematic 


ro 

CO 

JD 
r— 

rM 


4— ' 

CO 

cn 

03 
i- 


*-» 

CO 

w 

30 

X) 

DO 

*M 

3 « 

"M 


CO 

=> 
n 

rsl 
>J 

c— 

rsj 


X 

r- 
N - 
r- 
r~ 


+-» 

CO 

r- ( 
O ( 

rsl ' 
rsl < 

N < 


ro 

.£> 

"M 
r- 
N 
c— 
N 


ro 

ro 

o 

ro 

rg : 


ro 

K 


CO 

in 

LO 

ro 

r- 

cm 


3 

rM 
:m 

c— 

r\» < 


I— 

m 

r— 

:m 


S l 

JO 
J) 
30 
N 
r— 
N 


5 

O ' 

( 

CM < 
CO ( 
O ( 
CM : 


ra i 

to 
j> 
N < 
o 


" CO 

ro 
.n 

15 00 

3) in 

3 00 

as 


ro 

CD ' 
00 < 
00 I 
O i 
CM I 


ro 

CO 

35 
e- 

J> 
DO 
=) 
■M ( 


^0 
=> 
c— 

N 


3 

CM ( 

in i 
o ■ 

51 


4— « 

5 

D I 
f> < 
<3* I 
r- < 
r- < 

N < 


5, 

"0 < 
^- ( 
3) < 
D < 
N ( 


CO 

^ « 

J) 1 
30 ' 
3> < 
O * 
"M < 


(0 

S ! 

Si 

Mi 


j. 

M 

?: 


S . 

N 1 
35 1 
^ ( 
7i 1 
— * 
M ( 


1 

zr> ' 
— i 

rM ! 


3 

r- 

DO 
r- 
-M 


ro 
W | 

DO 

rM 
rM 


4— > 

CO 
N 

5 



WO 2004/111197 



PCT/US2004/018492 



22/81 



LL 
O 



2 

O 



o 

CO 



CO 

CD 
LO 
CL 
N 
LL 

Q 
c 



a. 

•c 
o 

CO 

c 
(0 



(0 

LU 
o 

CL 



CO 



X 

o 

X3 
I 

CD 



<u 

• £ 

"5. 
a) 
o 

to 
p 



— I— 

CO -g 

^ 2 

CD CL 

c c 
co 2 

€~ 
x £ 

® m 

IS o 

II 

3 CO . 

Is 

to 

£ o o 
o 



li 

a E 

X CO 



CO 

CO 
CO 

o 

to" 
CO 



2,2 

I— 

JO ^ 

i c * 

« o 



C O .CO 



6 ^ 



C CD 05 

co o g 

PI g 

co -5 



C O 
CD F 

co<S 



CD 
JQ 

£ 

CD 

E 

#«— > 

CL 

t 

o 

u 



c 

C 

la 
i 

CL 

< CD 



LL ±s 



CD 



£ E 



-D fa 

3 8 

(u a) 

II 

c a) 
CJ o_ 

3 >> 

$8. 



CD 

II 

0 ^ 

CL CO 

CD O 

C - 

E 

S © 

jo a) 
co « 

2! 

1 it 1 

■s-'e 

<p CO 
-J- co 
o 

a> o 



CD 

o 

Q) 
TO 

I 

c 

CD 
CO 
Lj 

c 

CO 

o 



CO 
O LD 



LU 



CD 

•a 
c 

CD 
CL 
CD 
TJ 
i 

CL 

9 



CO 

E 



CL 

DQ 
GQ 
LU 

C 

S 

e 

CL 
X 

o 

_D 
00 

CD 

CO 

c 
o 

Q- 

vy 

£ 

C 
CD 
CO 

s 

w 
0 



c 

s 

2 

CL 
O 

CO 

■4— » 

to 
ro 

-O 
O 

jr 

CL 
O 



.CD 



C 

o 

? 

in 



CM 



CL 



CM 



CM 

cr 
i 

CO 
CN 

CM 

cr 



a> 
in 

5 

CN 



CD 
CO 
O 
CD 



CO 
00 
CM 

CO 

. o, 



LO 

o 

CO 
CO 

o, 



CN 
CO 

7 

00 
CM 
cr 



£5 

O 

o 

< 



CM 
CO CM 

cm cr 
cr 

CO CM 



CM 
O 
CD 

CO 

o 



LO 
CD 

T— 

to 
O 
O 



CM 
CD 



CD 

CD 
LO 



to 

CO 



CM 
CO 
00 

CM 



CO 

CO 

CD 
CO 

CN 



ro, 



CM 



. co 

X 

CD 
CO 
CO 
CD 

CM 



CL 

CD 



CM 
CO 

cr 



Is 



I— H 



co 

CO 
LO 

CM 



CM 
CL 
CM 



CM 



CM 



cr 

00 



O) 



CM 



CM 



cr 
CO 



LO 
CD 
CO 
CM 

o 

CD 



CD 



LO 

7 

XT 

cr 
CD 



O 

r^. 

CD 
CD 
O 

o 



CD 



LU 



TO, 



CD 
LO 

§ 

CM 



WO 2004/111197 



PCT7US2004/018492 



23/81 



LU 

ct: 



CD 

Xf 

CL 
CD 
CL 

-a 

■g 
'u 
co 



< 

Q- 

05 



S 

CL 



S 1 

CL 

a: 



a) 

CO 
CD 

o 

2 



o 

I— 

a. 

To 

£ 
o 

CO 

o 

JQ 



E 

CO 

% 

CD 

c 
c 



CD 
o 

CD 
CL 
CO 
i 

00 
0 



111 

LU 

or 



c 

CO 
CL 
CO 



s 

c 

cd 

e 

CL 
CO 

E 
o 
co 
o 

JO 



a> 

CO 

5 
o 

CO 

to 

CO 



o 

CD 
CO 

CD 

cr 

X 

E 

CD 

E 

ct 
E 



o 

CL 
£0 
C 

+ 
X 

CD" 

to 

CO 



fc! cp 

J* 2 
o c 

si 

CO O 

X >> cl 



O 



CD 
CO 
CO 

s 

GL 
X 
CD 



CD 
Q. 
Cl 
Z3 



'§ .5 
o E 

CL JSf 

a) a? 
c?o 

o o 
c o 

N © 

W ~ 
C 
CD 
CL 
CO 
CO .1= 



CD 
•»-» 

2 

CL 

CD 
C 

X3 
C 

X 

CN 

s 

C 

E 
iS 



E z 

jo O 
co ' 
x 
o 
o 



o 

I'D 

Q. 
CD 
CL 

° 

CL 



a 

CO 

o" 

o 
c* 



— . _2 



CL 
CD 
CJ 
CD 

L. 

u 

"co ^ 

H 

CO 

* i 



IS 

la 

CO 

o 
c 

CD 
-C 
CL 



E 

CD 

co S 

O CD 

m q. 
>* 

CL O 

o * 
p -® 



CD 

o 

"co 
c 



a) 
jo 

CO 
CD 

c 

§ 

■a 
c 

CD 

e 

a. 



s 11 



C 

CD 

g 

XT 

till 



CO 
CL 
CD 



O 
La 
co 

2 



CD 
co 
iS 

f 

XI 

i— 

CO 

o 

CD 
"O 

a) 

CO 

E 

CO 



c 

S3 



© 
"O 

c 

co 

Xj 
c 

0) 

e 

CL 
CD 

c 
5 

X 



o 
o 



CD 



CN 

CD 
CO 
CL 



to 

CN 
CL 
CO 



CN 
CN 
cr 
co 



CO 
CN 
CT 
ID 



CN 
CT 
CN 



CO 
CO 



CM 

cr 



CD 
CO 



CD 
CO 

co 
co 

2 



CO 



O 

CO 



CN 
Q. 
CD 
CL 



CO 
CM 
CT 



CN 

T— 
7 



CO, 



CN 



CN 

CO 

o 
o 
o 

CM 



CO 

K 

CO 

o 
o 

CN 



CM 



CO 

m 
o 
o 
o 

CM 



CO 

CO 
CM 



N 









5 






CO 


CD 




CD 


in 


in 


in 


CN 


in 


in 


N- 


o 


o 


m 






o 




3 


CM 



CN 



o 
o 
o 
o, 



^ < 



CM 



CD 



CM 
CD 

O 
O 
O 



CD 

I 

O 



CM 
CL 



CO 

?u 

CM 
c- cr 
D" CO 

in 



CM 
LO 

tb 
^- m 
o o 
o Jo, 

2 2 



CM 
CO 

_ CM 

oHcn 

CL 

X 



8 11 



"l 






CO 






CO 








CM 


!? 




in 


o 






O 

CM 



a a 

h- It— 
O CD 

co kr 
cp Icq 
o o 

CM CN 



rO 
cr 
CM 



CO 




o 


x— 




CO 


CM 


00 


CO 


O 


o 


CO 


o 


o 


o 


o 




o 




2 






2 


2 



Q 

< 



CM 
CL 



CO 
CM 

in 
o 



WO 2004/111197 



PCT/US2004/018492 



24/81 





















! phosphodiesterase 4A, cAMP-specific 


claudin 10 (CLDN10) 


IKIAA0626 gene product 


! aldo-keto reductase family 1, member B10 


isushi-repeat protein (SRPUL) 


RAN binding protein- 17 (RANBP17) 


FLJ12155 


homogentisate 1 ,2-dioxygenase 


sulfotransferase family, cytosolic, 2B t member 


19p13.2 | 


|13q31-q34 | 


OO 

cm 

CO 

cr 


|7q31.31 | 


|Xq21.33-q23 


|5q34 I 


|19p13.2 i 


CO 
CM 

7 

v- 
CM 
cr 
CO 


19q13.3 


006202 


J306984 | 


.021647 i 


.020299 [ 


.014467 | 


.022897 | 


22217 | 


.000187 I 


004605 


2 


■ZL 




NM. 


NM. 


"IAIN I 


AKO 


"IAIN 




PDE4A 


CLDN10 ! 


KIAA0626 J 


AKR1B10 


SRPUL | 


RANBP17 l 


CM 
2 
DC 
h- 


Q 
CD 
X 


ISULT2B1 ] 


|204735_at 


+— « 

CO 

oo 

CN 

oo 

LO 

o 


TO 

CM 

LO 

o 

CM 


-*— • 

i 

CD 
LO 
CD 
O 

3 


To 
cd 

CD 
LO 

o 

CM 


#219661_at | 


3 

oo 

LO 
CM 


#205221_at [ 


to 

CD 
LO 

LO 

o 



WO 2004/111197 



25/81 



PCT/US2004/018492 



Description 


surfactant, 
pulmonary- 
associated 
protein C 
(SFTPC) 


hypothetical 
protein FU21511 
(FU21511) 




uteroglobin 
(UGB) 


DKFZp566M043 


cylicin, basic 
protein of sperm 
head 

cytoskeleton 1 


protein kinase C, 
delta binding 
protein 


Chromos 
ome 


CO 


-a- 


lO 


v— 


CM 


o 




Map 


% 

CO 


4p12 


LO 
CM 

I 

to 

V 


CO 

T— 

cr 

co 

CM 

a 

T— 


cr 

T— 

CM 


CM 


to' 
a 


Genbank 


00 

o 

CO 

o 

2 


h- 

00 

o 

LO 
CM 

°l 

■z 


AL109706 I 


to 

CO 
CO 

o 

°l 

2 


AL050065 i 


Z22780 


AI088622 


Common 

■ 


SFTPC 


FLJ21511 




SCGB1A1 




CYLC1 


PRKCDBP 


Expression in 
Smoker with cancer 
compared to 
smoker 


Down 


Down 


Down ! 


Down 


Down 


Down 


Down 


Fold change 
S-SC 


0.0195073 


0.1292919 


CO 
CO 

00 
CM 

d 


0.34256414 


0.36099912 | 


0.36278119 
— — j 


0.37048295 | 


Expression in 
Smoker with 
cancer 
compared to 
Non smoker 


Down 


Down 


Down ! 


UP 


Down 


Down 


Down 


Fold change 
NS-SC 


0.025522773 


0.69016811 


0.408836024 | 


2.595974945 


0.229272568 | 


0.4400818 


0.595397727 


Expression in 
Smoker 
compared to 
Non smoker 

i 


UP 


UP 


Q. 


UP 


Down 


UP 


UP 


Fold change 
NS-S 


1.308370044 


5.338061466 


CM 
CD 
CO 
CO 

ES 
o 

9. 


7.578069671 


CO 

o 

CD 

ir> 
O 

lf> 
CO 
CD 

d 


1.21307779 . 


1.607085348 



3 

d 
E 



WO 2004/111197 



26/81 



PCT/US2004/018492 



putative gene with 
similarity to zinc 
finger proteins; 
Homo sapiens 
chromosome 21 
segment 


aldo-keto 
reductase family 
1. member R10 


similar to rat 
mvomeaaiin 


3 

O 
c 

s— + 

„ Q5 
5 05 

.E 2 


t 

alanyl 

(membrane) 

aminopeptidase 

(aminopeptidase 

aminopeptidase 
M, microsomal 
aminopeptidase, 
CD13, p150) 
(ANPEP} 


hypothetical 
protein FLJ12567 


hypothetical 
protein FU12975 


CM 






r* 


in 


o 


CM 


T— 

x— 

cm 


7q31.31 


CM 
r- 

cr 


CO 
i 

cr 

T— 


CO 
CM 

$ 

CM 
CP 
ID 
v- 




12q24.31 


AL163202 


o> 

05 
Csl 
O 

s 

2 


AB042557 


LO 
v~ 
CO 
CO 

o 

°l 

Z 


O 
LO 
r— 
t— 
O 

3 

2 


i 

2 


05 

o 

CO 

s 

3 . 

2 


PRED65 


AKR1B10 ■' 


PDE4DIP 


CAPN9 


ANPEP 




FU12975 


Down 


Down 


Down 


Down 


Down 


Down 


Down 


0.39196371 


0.42944426 


0.4404226 


0.44071146 


0.45368281 


0.45504678 


0.46426629 


UP 


Down 


UP 


Down 


UP 


Down 


Down 


1.156448477 


0.029074395 


1.279375287 


0.464119455 


1.383543464 


0.696038224 


0.762232351 


UP 


Down 


a. 


2 


CL 
3 


D 


a. 


2.950396825 


•O 

CD < 
=> 


5 

r- 

DO 

s 

*M 


i— 

r— 

t— 
r- 

"O 


«— 

to 

r- 

X) < 

2 

a 

^0 


ID « 
O < 
^- < 
35 ( 

£ ! 


s 

35 
r* 

S 



WO 2004/111197 



27/81 



PCT/US2004/018492 



H-RYK receptor 
tyrosine kinase* 


KIAA0740 protein 


glycoprotein M6B 


hypothetical 
protein 

DKFZD761F2014 


hypothetical 
protein FU23191 


adenylate kinase 


mitotic spindle 
colled-coil related 
protein 


r~ 

renal tumor 


embryonic 
myosin alkali 
light chain 


RAB5C, member 
RAS oncogene 

family 


1 glutamate 


RecQ protein-like 


CO 


o 


o 






ro 










CM 




CM 
CM 

cr 

CO 


10q22.1 


CM 

oi 

CM 
CL 

X 


CM 

oi 

CO 

T— 


OI 


9q34.1 


cr ■ 


14q32 


CD 

? 

T— 

OI 

cr 


OJ 

CM 
cr 


T— 

CO 

CT 
CM 


CO 

in 

CM 

cr 

OI 

in 

£ ! 


X96588 


AB018283 


AF016004 


lO 

1 

s 

z 


3 

z 


BC001116 


CO 

CD 
O 

z 


CD 

9 

z 


M36172 


CO 
00 

m 

3 

z 


r» 

00 

o 
o 

5 

z 


AB006533 


RYK 


RHOBTB1 


GPM6B 


DKFZp761F2014 


FLJ23191 


AK1 


SPAG5 


RAGE 


MYL4 

* 


RAB5C 


GAD1 


RECQL5 


Down 


Down 


Down 


Down 


Down 


Down 


Down 


Down 


Down 


Down 


Down 


Down 


0.47391058 


0.47477663 


0.4999007 


0.51802131 


0.52824722 


0.52962462 


0.53922439 


0.56149843 


0,56479139 


0.56611815 


0.56856846 


0.56930769 


Down 


Down 


UP 


UP 


Down 


UP 


Down 


Down 


Down 


UP 


Down 


o 
z 


0.691341256 


0.761512027 


1.446077458 


1.269667568 


in 

CO 

o 

3> 
N 
CO 
D 


1.121950712 


0.902488906 


0.82241624 


0.459165545 

i 


CO 
00 

CD 
LO 
CO 
CO 
CO 
CM 
r- 


0.115767635 


CM 
o> 

CO 

I s * i 
o 

CO i 

oo 

ro 

co 

d 


UP 


a. 

D 


0. 


CL 
ID 


3l ( 


DL 
D 


a. 

d : 


a. 

D { 


uown 


3- 


uown 


n_ 

D 


1.458801051 


to 

^ 

ro 
o> 
ro 

CO 
£> 
r-* 


I 

CM 

CM 
□) 
30 
CM 


CM < 
<3* 

OO 1 
V < 
ro 

35 1 

O i 
LO 1 

* » 

cm h 


*> 

O 1 

ro 

• — . 
n 

r- 1 < 


•o 

n < 

30 < 
30 1 
T> 1 

5 

i- 1 


3> ( 
X> ( 

s : 

r— ■< 


r- < 
30 1 
N 1 
r- ( 
30 I 

O i 
<r < 

D 

<r c 

c" ( 


D < 

n 

n < 

N C 

5 1 

N t 
I- c 
JO i 
=> c 


ro < 

^ i 

3 ; 

n ^ 
* < 

a c 
i- ( 
vi c 


35 l 

^ i 

( 

M i 
f- I 
D 

O < 

N < 
6 i 


n 
n 

D 

n 
n 

So 
<r 

D 

— 



WO 2004/111197 PCT/US2004/018492 

28/81 



thyroid hormone 
receptor 
interactor 13 
(TRIP13) 


KIAA0420 
nuclear 
transcription 
factor, X-box 
binding 1 (NFX1) 


myosin XVB, 


period homolog 1 


testis specific, 10 


tripartite motif- 
containing 3 


in 


CO 

CD 


T— 




CM 




<N < 

Q. ( 

in 


EO 

f T— 

r> 

r~ ^ 
Q. CO 


CO 

iri 

CM 


CM 

Q. 
h- 

CO 
D. 

T— 


2q11.2 


in 
in 
a 

T— 


CO c 

S f 

z « 


3 3 

□ 2 
* z 


AI825877 


AF022991 


in 

CM 

°l 
Z 


AA1 14843 


51 5 

5 5 
h- * 


ODF2 


MY015B 


PER1 


TSGA10 


TRIM3 


Down 


Down 


(Down 


Down 


Down 


Down 


0.57930229 


0.58126673 


0.58574764 


0.59662189 


0.60295459 


0.60651085 


o I 

Z Q 


Down 


ON 


O 

z 


Down 


Down 


1.038833852 
0.668732537 


0.873933988 


to 

in 

■n 
r- 


m 

CO 

o 
<t 

N 

D < 

^ ( 
35 ( 
3 < 


tn < 
s. 

o < 

33 1 
30 < 
■M < 

n < 
=> < 

30 f 
D < 


n 

3> 

j> 
o 

D 


CL Q_ 


UP 


1- c 

d : 


L C 

d : 


L C 

d : 


L 
D 


1.793250027 
1.15064901 


1503499079 


I c 

C 

o 
o 

d- i 
o c 
0 c 

^ c 

— T 


D C 
3* c 

3 C 

» f 
O u 

O U 

3 ? 


O 

O 0 

D C 

— T 

8 * 

2 a 
o r 


0 
0 

3 

& 

3 
si 



WO 2004/111197 



29/81 



PCT/DS2004/018492 



s 

o c 



9- 

CO 



•i= o 



So 

•5 1 



W 
CD 
C 

= f 

<D w 

E 



(0 ^ — 



F < 



33 

o "G 

Q. CO 



c 

» .2 < 

"PIS 



a: ^ 

IS 

UJ UJ 
LL CD 



_ O 

II l_ 

Q 2 

O s 

a. c 



CO 

c 
a> 

S o 

1 1 



o 
m 

'a> 
ii 

LL 
LU 

tr 

5, 



w 



. 00 

m o 
Q 10 



rifles? 



x 

ii 

CD w Q 

5 5 E 



CO 

co T 
o o 

O CO 
O CNJ 

LL i- 



CO 
O 
CO 
00 

co 
o 

ID 



c 

a 



CO 
CO 
CO 
CO 

LO 
CO 
CO 

d 



CM 
CO 

o 
CO 1 

2-E 
o o 

Ss 

x: a. 



r- 



WO 2004/111197 PCT/US2004/018492 

30/81 



S100 calcium- 
binding protein 
A10 (annexin II 
ligand, calpactin 
I. light 
polypeptide 

A A \\ 

(011 11 


CQ 
c 

=11 


KIAA0683 gene 


hypothetical 


testis specific 
leucine rich 
repeat protein 


v\* ■ / 

hypothetical 
protein FLJ10482 


hypothetical 18.1 
kDa protein 
(CDKN2A) 


putative tumor 
suppressor 


hypothetical 

protein 

MGC35048 




CM 


CO 


O 


00 


^* 

T- 


05 


CO 


CO 

V 


t— 

CM 


1 

CO 

SL 

CM 


16p13.12 


10q24.32 


CM 
CM 

cr 
oo 


14q11.1 


2. 

CD 


3p21.3 


CO 
CO 

T— 

a. 

CD 


996200 IAIN 


Y15718 


AB014583 


B 

CM 

S 

Z 


v— 

S 

z 


NM_018107 


U38945 


U23946 


AW070229 


S100A10 


DTNB 


KIAA0683 


PP5395 


TSLRP 


FLJ10482 


CDKN2A 


RBM5 


MGC35048 


Down 


Down 


Down 


Down 


Down 


Down 


Down 


Down 


Down 


0.61112972 


0.61484746 


0.63095389 


0.64428835 


0.65160307 


0.6529297 


0.66876791 


0.67032104 


0.67470784 


Down 


Down 


ON 


Down 


Down 


Down 


Down 


ON 


Down 


0.328592534 


CD 
CM 
P-- 

co 

St 

CO 
CO 
M- 

o' 


0.945041061 


CD 
CM 
CO 

m 

s 

o 

CO 
CD 


CO 
CO 
CM 

5" 

CD 
CO 

so 


CO 

CO 
r- 
LO 
M* 

SO 


LO 

o 

CO 

2 
8 

d 


LO 
CD 
DO 
CD 
* 


r*- 

M 
o 

CO 

^ j 

CO 

a 

30 
=1 


Down 


uown 


a. i 

r> : 


X ( 

d : 


3- f 

d : 


J- 

D t 


uown 


L C 

d : 


1- 

D 


0.537680505 


a 

7) 
3> 

? 

o 
o 

So * 

3 i 


^» i 
o 

O < 
^ ( 
J> t 

^ l 

5 ! 
— ^ 


r- ( 
r- ( 

3 I 

i < 

* c 


90 i 

O 

D 

O C 

? 
D 

3 


t i 

2 J 

i < 

M I 
- C 


L 

O * 
*- c 

2 L 

2. 5 

jT C 
O L 

D T 


o 
a- 

O C 
D i 

2 f 

O L 


O 

t 

5 

D 



WO 2004/111197 



31/81 



PCTYUS2004/018492 

i 




WO 2004/111197- 



32/81 



LIS1-interacting j 
protein NUDEL; f 
endooligopeptidas - 
e A (NUDEL) I 


FLJ23099 fis, 
highly similar to 
AF1 13251 
putative 2- 
hydroxyacid 
dehydrogenase 
mRNA 


DKFZp761H087 
hypothetical 
protein p53 
inducible oroteln 


aldehyde 

dehydrogenase 2, 

mitbchondriai 

(ALDH2) 


t. 

CO 

£ 

Q c 

€ s 

o o 

C D 
- □) 03 


lysosomal 
glucocerebrosidas 


HMG-box 
transcription 
factor TCF-3 1 


deubiquitinating 5 
enzyme UnpES J 


T— 


o> 




CM 


CO 




CM 


CO 


CM 
r- 


9q12 


5q34 


CM 
CM 

8 •. 


13q34 


T~ 

CM 
O" 


2p11.2 


CO 

CM 

CL 
CO 


9090CO IAIN 


AK026752 


AL161999 


069000 IAIN 


L13720 


K02920 


CO 
00 
CM 
r— 
CO 

3 

z 


AF017306 


NUDEL 


GRHPR; GLXR 


CYFIP2 


ALDH2 


GAS6 


GBA 


CO 
Li 
O 


USP4 


Down 


Down 


Down 


Down 


Down 


Down 


Down 


Down 


0.74223943 


0.7436369 


0.74440572 


0.74512283 


0.7521483 


0.75779503 


0.77284852 


0.7771127 


Down 


ON 


UP 


NC 


UP 


ON 


UP 


ON 


0.703663142 


0.988549392 


2.059951203 


1.051352347 


1.263930929 


0.974555979 


1.276821564 


0.912353309 


ON 


0- 


UP 




tt. 


EL 
D 




D 


0.948027169 


<fr 
co 
r- 
M" 

ro 
^ 


in 

P 
I s - 

rsi 


oo 
m 

CO 
I— 


in 

CM 
CO 

M* 

O 
93 
X) 
«— 


N. 
O 
© 
r- 
M" 

© 
30 
N 


CO 1 
33 < 
3 i 

2S ( 

3 ( 
M 

m i 

JD 

r~ ' 


n 
35 
n 

33 
N 
ID 



PCT7US2004/018492 



WO 2004/111197 



33/81 



PCT/US2004/018492 



sperm associated 
antigen 7 
(SPAG7) 


claudin 10 
(CLDN10) 


KIAA0052 protein 


heterogeneous 
nuclear 

ribonucleoprotein 
D-like (HNRPDL) 


putative 
transmembrane 
Drotein (HS1-2) 


Kallmann 
syndrome 1 
seauence (XAL1 


FK506 binding 
protein precursor 
(LOC51303) 


O-linked N- 

acetylglucosamine 

(GlcNAc) 

transferase (UDP- 
N- 

acetylglucosamine 
:polypeptide-N- 
acetylglucosaminy 
I transferase) 




CO 


LO 




LO 

T— 


o 


CM 
r— 


o 


17p13.3 


s 

7 

CO 

cr 

CO 
r— 


T- 
T— 

cr 

LO 


CM 

cr 

ch 


s 
s 

r- 


CO 

i 

X 


12p11.1 




o 

CO 

i 


3 

05 
CO 

o 

5 

z 


D29641 


co 

CO 

s 

o 

3 

21 


O 

J 

Z 


CO 

§ 

o 

i 

z 


1 
z 


AF070560 


SPAG7 


CLDN10 


PPAP2A 


HNRPDL 


ABHD2 


KAL1 


FKBP11 




Down 


Down 


Down 


Down 


Down 


Down 


Down 


Down 


0.77712803 


0.78659224 


0.79480585 


0.80201364 


0.8053316 


0.83837854 


0.84170931 


0.84936698 


UP 


Down 


NC 


ON 


Down 


UP 


Down 


UP 


1.167508233 


0.221632397 


1.063035804 


0.98152846 


0.473428695 


2.001653726 


0.416042041 


1.218311394 


UP 


Down 


UP 


UP 


Down 


UP 


Down 


0- 
D 


1.502337049 


0.281762755 


CO 
CO 
LO 
CO 

h- 

CO 
CO 

T— 


? 
5 

CO 
OO 
CO 

s 


CO 

5! 
o 
oo 

D 
30 

CO 

LO 

d 


oo 

CO 

05 
CM 
O 

30 
CO 
CM 


00 

i 

d 


ro 



WO 2004/111197 



34/81 



PCTYUS2004/018492 



S100 calcium- 
binding protein P 
(S100P) 


t 

CO 

X *- 

CD CD O) 
W C 2 

of E 

•£ -S o 


steroid sulfatase 
(microsomal), 
arylsulfatase C, 


Similar to 
hypothetical 
protein FLJ13052 


RAN binding 
protein 17 


homogentisate 
1 ,2-dioxygenase 
(homogentisate 


2,3-cyclic 
nucleotide 3 
phosphodiesteras 


hairyenhancer-of- 
split related with 
YRPW motif 1 
(HEY1) 




CM CD 
CM t— 


o 


T— 




CO 


I 


00 


CO 

CL 
"3- 


CO CO 
r- t— 

cr cr 

CM CD 
CM t- 


CM 
CO 
CM 
CM 
Q. 
X 


1p36.33-p36.21 


5q34 


CO 
CM 

cr 
i 

CM 

cr 

CO 


1 


8q21 


O 

! 

1 \ 

2 I 


U84487 


AU138166 


BC001709 


CO 
CO 

1 

2 


CO 

o 
o 

5 

2 


BC001362 


CO 
ID 

SI 

T— i 

5 

2 


S100P 


CX3CL1 


STS 


FLJ13052 


RANBP17 


HGD 


CNP 


HEY1 


Down 


Up 


Up 


Up 


Up 


jUp 


Up 


Q. 

r> 


0.8924865 


1.27967337 


1.28769231 


1.30737751 


[1.33002364 


1.38337079 


00 
ID 
CM 

S 


1.40957199 


Down 
UP 


UP 


uown 


Down 


l. 


uown 


-> i 

z : 


D 


0.473970046 
2.042740959 


2.063668342 


s 

s : 

5 \ 

5 c 


■O ( 

» i 

s 
s 


ST < 

5 1 
2 r 

J : 
s 5 

^ C 
- C 


2 c 

3 C 

° £ 
— h 

M C 
D It 


o ' ! 
8 ! 

2 r 
o c 
*» •» 

D C 
C 


"M 
30 
O 
D 
r- 
v. 
3 

D 

si 


O CL 

q r> 


UP 


i 

^ c 


I a 


1 

u i 
:> c 


! J 


5 D 

5 : 


L 


0.53106691 
1.598358991 


1.612652412 


o § 
5 5 

? 

5 C 


2 r 
o ^ 

2 

5 

; 

3 t- 


4 T 

5 

5 

; 

c 


; r 

5 • itr 

I S 

5 |C 


1 
• 

: 

> 


I 
3 



WO 2004/111197 



35/81 



PCT/US2004/018492 



valosin-containing 
protein 


BX1 

CDC5 cell division 
cycle 5-like (S. 

nnmha^ 


r w 

sushi-repeat 

nrntoin /CDDI II \ 


histone 

deacetylase 2 
mnAro\ 


PFTAIRE protein 
kinase 1 


RNA binding motif 
protein 8A 


LIM domain only 4 


similar to thyroid 
hormone receptor- 
associated 
protein complex 
component 
TRAP240 


RAD23 homolog B 
(S. cerevlsiae) 


o> 


^ CD 


O 


CD 






CN r- 


CM 

T* 


O) 


CN/ 

5 

CL 
CD 


4 

6p21 


CO 
CN 

X 

CO 
t— 
CN 


6q21 


CN 
CN 

7 

V" 

CN 

t? 


1q12 


t- CO 

sa 

Q. CL 
CNJ r- 


CN 

si 


CN 
CO 

cr 

CO 


W60953 


AW268817 


CO 

1 

z 


CN 

m 
o 

j 

Z 


in 

O) 
CO 
CN 

o ( 

z 


AI738479 


AI824831 


AK023837 


AL527365 


VCP 


CDC5L 


SRPUL 


HDAC2 


PFTK1 


RBM8A 


LM04 


KIAA1025 


RAD23B 


Up 


Up 


Up 


Up 


Up 


Up 

1 In 


Up 


Up 


Up 


1.47527728 


1.47925296 


1.49857215 


1.50604822 


1.51373117 


1.56967083 


1.62667893 


1.64637135 


1.6557181 


UP 
UP 


UP 


Down 


z 


CL 


CL u 
CD Z 


UP 


CL 


D 

z 


1.180536673 
1.403407566 


1.356213018 


CN 
<3* 

ro < 

30 < 

■o 

n ( 
^ < 


N 
X) 
N 
n 

O < 
< 

s ; 

3 ( 
r~ ^ 


5 

x> 
<r 

N < 
7i i 
( 

n < 


1.076956056 


3.07175713 


N 

? ! 

i i 


r* 

n 

r* 
35 
O 
D 


i o 

Q Z 


O 

Z t 


uown 


D C 

d : 


L 

0 £ 


uown 
Down 


UP 


§ 

0 < 

1 c 




0.800213416 
0.949851543 


0.916822921 


1 s 

a- u 

D T 

n i* 

D CI 
O CJ 

I> c 


J> c 
t t 

% I 

0 C 

- c 

- u 

1 ? 

3 -r 


O b 
— x 

D T 

» C 

d r 
2 c 

0 0 

c 


U./UOU/U 1 13 

0.671029785 


1.888361045 


8 S 

O t 

5 5 
^ & 

M ^ 

CI 

3 C 


r 
■) 
r 

•> 
r 



WO 2004/111197 



36/81 



PCTAJS2004/018492 



phosphatase 2, 
regulatory subunit 
B (B56), gamma 
isoform 
(PPP2R5C) 


ubiquitin specific 
protease 14 
(tRNA-guanine 
transglycosyiase) 


heterogeneous 
nuclear 

ribonucleoprotein 
R 


glutamyl-prolyl- 
tRNA synthetase* 


G protein-coupled 
receDtor 


ribosomal protein 
S20 


capping protein 
(actin filament) 
muscle Z-line, 
beta 


ribosomal protein 


cyclic AMP 

nhnsnhnnrntoin 


ESTs 


CO 


CO 
T- 




T— 


CM 


CO 




CO 


LO 


CO 


3p21 


CM 

CO 

T- 

r— 
CL 

CO 


1p36.11 


! 

cr 


2q37.3 


CM 

T- 

o* 

CO 


T** 

CD 
CO 
CL 


I— 

CD 

1 

CM 
cr 

CO 


15q15.3 


CO 


NM_002719 


BC003556 


BC001449 


AI475965 


AI817041 


AL576654 


U03271 


AW517686 


BC003418 


BF448315 


PPP2R5C 


T- 

CL 
CO 
*3 


HNRPR 


EPRS 


RDC1 


RPS20 


CAPZB 


RPL35A 


ARPP-19 




Up 


Up 


Up 


|Up 


|Up 


Up 


Up 


Up 


Up 


Cl 


1.73082466 


1.73397436 


1.77202113 


1.77823154 


1.81671606 


1.82967985 


1.86331201 


1.87890685 


1.88697215 


1.8942492 | 


UP 


UP 


UP 


UP 


UP 


UP 


UP 


UP 


UP 


UP 


1.437502002 


1.150213675 


1.303463119 


1.701002735 


12.220391227 


1.983126177 


1.260724947 


1.313625866 


1.249595687 


1.955271565 | 


Down 


Down 


Down 


ON 


UP 


O 
2 


Down 


Down 


Down 


O 
2 


0.830530114 


8 

CD 
CO 
CO 
CO 

s 

d 


0.735579896 


CM 

CO 
CO 

a> . 

CO 
LO 
CO 

s 

d 


1.22220047 


rM 

r— 

in 

CO 
OO 

ro 


LO 
»— 
CO 

^. 
o 

CD 
CO 

CD 

a 


DO 
X> 

CO 

ro 
* 

35 
35 
° 
3 


LO 

CM 
CM 
CO 

to 

CD 


o> 

ro 

LO 

<T 

CM 
CM 

3 

t— 



WO 2004/111197 



37/81 



PCT/US2004/018492 



T cell receptor 
gamma-chain 
mRNA, C2 
reaion 




hypothetical 
protein FU23514 


hypothetical 
protein FLJ22489 


LIM domain only 

4 * ! 


tumor necrosis 
factor (ligand) 
superfamily, 
member 10 


ARP3 actin- 
related protein 3 


KIAA0582 protein 


If 

C 

c 

CN 
LL 


zinc finger 
protein 148 (pHZ- 


non-functional 
folate binding 
protein 




CO 
t— 








CO 


CM 


CM 


O 


CO 


o 


7p15-p14 


CO 
CO 

T— 

a 

CO 


11q14.1 


cr 
i 

CM 

cr 

N» 


CO 

cm 

CM 

Q. 
V" 


3q26 


CO 

cr 

CM 


2p14 




T— 

CM 

or 

CO 




M27331 


NM 005830 


OO 
OJ 

°l 

Z 


AK026142 


BC003600 


NMJ303810 


Z78330 


AB011154 


AK022067 1 


s 

CD 

§ , 

2 
Z 


s 

CO 

5 I 

Z 


TRG@ 


IMRPS31 ! 


FU23514 


ODAG 


LM04 


TNFSF10 


ACTR3 


RAB1A 

i 




ZNF148 




Up 


CL 

3 


Up 


Up 


Q. 
3 


Up 


Up 


Up 


Up ! 


Up 


Up 


1.9122807 


11.93178937 | 


1.96875323 


2.03108982 


2.04251084 


2.07985401 


2.08886347 


2.09323599 


2.10989918 


2.15639007 


2.20779768 


UP 


UP 


UP 


UP 


UP 


UP 


UP 


UP 


UP ! 


|UP 


UP 


1.721052632 


CO 
CO 

a? 

s 

oo 


3.036109674 


1.932646707 


3.167032406 


1.526395866 


00 
CO 

3 


2.41906807 


11.51512374 


5 

in 

CM 
O 
OO 

in 

CM 


1.568640674 


ON 


uown 


CL 
3 


z 


a. 

3 < 


Down 


uown 


CL 
3 


Down j 


Z 


Uown 


< 

< 

< 

CD < 

d ( 


30 
S> 

CD 
N 
X) 

5 


CO 

«- 
<r 

ao 

r- 

rsi 

s 

r- 


ED 
N. 
90 
r- 

ro 
si 
i- 

tO 
3) 

d 


□> < 

rM l 
<r < 

30 1 

n < 

lO 

q 

r* 1 


SO < 
< 

CD 

n < 

J> < 
30 


N 

■o 
<r 

■o 

r- 
^« 

•*» 1 

n 
=> 


CO 1 
CD < 
Si 

3> 1 

m ( 

S3 

n i 

Si 

r- | 

c-* < 


< 

?! 

™M < 

r- 

^. 


"0 

3 < 
N 

35 < 
X> < 
c— 

q i 

r" < 


» 
5 

r- 
s« 

D 



WO 2004/111197 PCT/US2004/018492 

38/81 



GAP-associated 


c 

'5 *~ 

11 

W CO 
3 O C\ 

t- JZ cc 
2? Q. c 


methyl-CpG 
binding domain 


hepatic leukemia 
factor (HLF) 


hypothetical 
protein FLJ20063 


epidermal growth 
factor receptor 
(erythroblastic 
leukemia viral (v- 
erb-b) oncogene 


> JS 

5 r- (I'D r- t/J 7 
- a fl> <f CO « 

» c c 

! 5 -1 8 J o I 

: c <> w o = 

> >» W u O) c 

: » cm co cl to c 


CD2 antigen 
(p50), sheep red 
blood cell 
receptor (CD2) 




CO 




CO 




CD 

T— 




CO 
CL 


cr 

CM 
cr 

CO 


17q22 


3q13.3 


7p12 


CO 
CM 
Cr 

CM 
CM 
cr 
CD 
t— 


1p13 


BC000717 


AL556619 


M95585 


NM_017648 


AW1 57070 


O 
10 

r* 

s 
°l 

z 


(2 

O 

9 

z 


KHDRBS1 


Q 

GO 


HLF 


MUC13 


EGFR 


SNTB2 


CD2 


CL 


Up 


Up 


Up 


Up 


a 
D 


Up | 


2.22845501 


2.28992904 


2.32384937 


2.41239892 


2.47103115 


2.47647832 


2.56006265 


UP 


UP 


CL, 
D 


ON 


UP 


UP 


UP 


1.557351443 


1.541355881 


CM 
r— 

O) 
OO 

ro 
ro 

r- 

cm 

XJ 


5 

o 
o 

CD 
^ 

ED 
D> 
£D 


CO 

s 

ro 
<fr 

LO 
r* 

CO 


1.844415243 


CM 

s 

□> 

X) 
CM 
3 
J) 
3> 


Down 


uown 


!L 

3 i 


uown 


}- 

3 t 


[uown 


uown 


0.698848053 


3 5 

!— ■« 

C 
- 

o c 

3 C 


o ! 
«- | 
30 

M I 
* ( 

2 ( 
M ( 
M < 


3 ! 

s : 

JO I 
3> C 
3) 1 
*> C 
=> t 


I ! 

M c 

o r 

O f 
0 * 

N r 
c 


N 
5 
f 
3 


O 

O 
* 
««. 

v. 

3 



WO 2004/111197 



39/81 



PCTYUS2004/018492 



FU12802 fis, 
clone 

NT2RP2002124, 
weakly similar to 
UBIQUITIN 
CARBOXYL- 
TERMINAL 
HYDROLASE 4 


hypothetical 
protein FU11271 


dihydropyrimidina 
se-like 2 
(DPYSL2) 


signal transducer 
and activator of 
transcription 1, 
91kDa ! 


signal transducer 
and activator of 
transcription 1, 
91 kD (STAT1) 


KIAA0737 gene 
product 


KIAA1096 protein 


T-cell receptor 
rearranged beta- 
chain V-region 
(V-D-J) 






CO 


CM 


CM 








1p31.1 


CO 

s 

7 

csi 

CM 
V 


T— 

CM 
Q. 

CM 
CM 
CL 
CO 


CM 
CM* 
CO 
O" 
CM 


CM 

CM 
CO 




CO 
CO 
CM 
CT 
r— 


14q32.33 


AK022864 


CO 

CO 
CO 

1 

Z 


CD 
CO 
CO 

T— 

O 
°l 
Z 


M97935 


in 

T— 

CO 
O 

s 

Z 


BE783632 


AL096857 


M15564 | 


VDU1 


SYNJ2BP 


DPYSL2 


STAT1 


STAT1 


KIAA0737 


KIAA1096 


IGHM 


Up 


Up 


Up 


Up 


Up 


Up 


Up 


Up 


2.60522193 


2.61085884 


2.73148309 


2.79879603 


2.89818105 


2.95221519 


3.27600275 


3.31842024 


UP 


UP 


UP 


UP 


UP 


UP 


UP 


UP 


2.04998135 ! 


1.961105627 


4.103729402 


1.726232295 


1.757263068 


2.657594937 


2.119459088 ( 

i 


2.297068806 


Down 


Down 


UP 


Down 


Down 


ON 


Down 


Down 


0.78687398 


0.751134301 


1.502381406 


0.61677674 


0.60633309 


0.900203666 


0.64696499 


0.692217573 



WO 2004/111197 



40/81 



PCTYUS2004/018492 



peptidyl-prolyl 
isomerase G 
fcyclophilin G) 


deleted in 
malignant brain 
tumors 1 
(DMBT1) 


kinectin 1 (kinesin 
receptor) 


tumor necrosis 
factor (ligand) 
superfamily, 
member 10 


c 
'E 

CO 

c 

2 _ 

i§ 


CM 


o 




CO 


CO 


r- 

T— 

CO 


CD 
CM 

V 

CO 

in 

CM 

or 
o 


5 

cr 

T— 


3q26 


CO 
CM 

cr 
i 

T— 

CM 

cr 

CO 


I 

2 

Z 


8 
3 

9 

z 


BF589024 


AW474434 


CO 

i, 

Z 


PPIG 


DMBT1 


KTN1 


TNFSF10 


LTF 


Up 


Up 


Up 


Up 


Up 


3.35522953 


3.61129645 


3.88999458 


3.94019139 


34.438061 


UP 


UP 


UP 


UP 


UP 


2.717463322 


2.143352601 


2.384544469 


2.030847171 


14.65960503 


Down 


Down 


Down 


Down 


Down 


0.809918754 


0.593513336 


0.612994291 


0.515418408 


LO 
CM 

ro 
o 

CO 
CO 

in 

CM 



WO 2004/111197 



PCT/US2004/018492 



41/81 





Description |" 


lysosomal glucocerebrosidase 


renal tumor antigen (RAGE) I 


FK506 binding protein precursor (LOC51303) 


mitotic spindle coiled-coil related protein 
(DEEPEST) 


highly similar to AF1 13251 Homo sapiens 
putative 2-hydroxyacid dehydrogenase 


weakly similar to UBIQUITIN CARBOXYL- 
TERMINAL HYDROLASE 4 


signal transducer and activator of transcription 1 , 
91kDa 


jLIS1 -interacting protein NUDEL; 
endooiigopeptidase A (NUDEL) 


| hypothetical protein MGC35048 ! 


GAP-associated tyrosine phosphoprotein p62 
(Sam68) 


deubiquitinating enzyme UnpES (UNP) mRNA, 
complete cds. ubiquitin specific protease 4 
(proto-oncogene) 


|cDNA FLJ12005 fis | 


Iclaudin 10 (CLDN10) | 


(Kallmann syndrome 1 sequence (KAL1) I 


aldehyde dehydrogenase 2, mitochondrial 
(ALDH2) 


| hypothetical protein FLJ 12567 | 


|GS3955 protein (GS3955) | 


|ARP3 actin-related protein 3 homolog (yeast) ! 


I RAD23 homolog B (S. cerevisiae) 


Imethyl-CpG binding domain protein 4 | 


epidermal growth factor receptor (erythroblastic 
leukemia viral (v-erb-b) oncogene homolog, 
avian) 




Q. 
CD 
? 


1q21 . | 


14q32 | 


12p11.1 


17q11.1 


CM 

cr 

O) 


1p31.1 


2q32.2 


17p12 


CO 
CO 

T— 

CL 

CD 


1p32 


3p21.3 




CO 

7 

CO 

cr 

CO 


|Xp22.32 


12q24.2 




|2p25.1 


I2q13 


|9q31.2 


cr 

T— 

CN 
CT 
CO 


7p12 




Genbank | 


K02920 j 


NM_014226 | 


NM_016594 


T- 

co 

CO 

o 

1 


AK026752 


AK022864 


M97935 


CO 

o 

CO 

o 

CO 

°1 


IAW070229 | 


BC000717 


AF017306 


|AK022067 ! 


^ 

00 
G> 
CD 
O 
O 

Z 


CD- 

T~ 

CN 
O 
O 

J 


069000 IAIN 


CN 
°l 

Z 


|NMJ>21643 


IZ78330 


[AL527365 


AL556619 


AW1 57070 




Common ] 


GBA | 


RAGE | 


FKBP11 


SPAG5 


GRHPR; GLXR 


VDU1 


STAT1 


NUDEL 


|MGC35048 ! 


KHDRBS1 


USP4 




|CLDN10 1 


|KAL1 


ALDH2 




IGS3955 


IACTR3 


|RAD23B 


o 

CO 


EGFR 


|53003_analysis | 


(Systematic | 


|209093_s_at | 


|2051 30_at | 


3 

CO 

T— 
T— 

O) 
CN 


+-» 

lO 
CO 

o 

CM 


*l 

CO 

o 

CO 
CO 

T— 

CM 


CO 
CO 
CO 
CO 

CM 


CO 

o> 

Oi 

i 

CO 
LL 

CD 
. CO -ft 

5 CO 

< X in' 


CO. 
0) 
CO 

o 

CO 

o 

CM 


"co 

CM 

a> 

CO 
CO 

CM 


CO 
X 
00 

co 
o 

CM 


211800 s at 


|214912_at 


|205328_at 


CD 

o 

CN 
LO 

o 

CN 


"5. 

LO 
CM 

o 

CM 


^— < 
CO 

^3- 
LO 
T — 

0> 

T— 

CN 


-4— ' 

CO 
CO 

h- 
o 

CM 


I213102_at 


CO 
CO 
CN 

o 

CN 


|209579_s_at 


.*-» 

CO 

CO 

CO 
CO 

o> 
o 

CM 



WO 2004/111197 



PCT/US2004/018492 



42/81 



J? 



31 
2 

Q. 

O 

o 
3 
c: 
o 

-O 

■cz 

i_ 
CO 
CD 

o 

3 



CO 

o 

CD _ 

^ Q 



O 

T— 

CD 

o co 

£ -6 

_ 3> 



<D CD 
^ 05 

ID 3* 



CL 

■vz 
u 



LL! 

o 
1= 
*o 
cd 
a. 
co 

I 

E 

CD 



Q. 

CD 

s 

CD l_ 

to £ 

P Q 



o 


CO 


T— 


C 


< 


T3 


CD 


C 




la 


b 


a> 


o 








o~ 




*o 

CD 


tior 
103 


a. 


O " 


03 


c ° 


3 LL 


CO 


*V < 


L— ' 
CO 
CD 

•4— < 


£ w 



J3 

3 

CO 

J5 CM 

3 a. 
a) a. 
£ cl 

cm" £ 

CD o 

B 8 
co — 

to E 
° E 
a. £ 



o ^ 



X 



5 



a> .2 
a. P 

1 o 

CO 

4= 5 
"p Q_ 

.E a) 
to o 

CD 

a> 0 



co ^ 

j| 
-11 



I? 

CO 

^ CD 

n 

2 o 

s °- 

C0.2> 
C *~" 

c 



c 

'•S 

CO 

73 O 

C O 

_ CO t- 



3 

o 
CD 

0) 

c 
E 

CO 
CO 

is 

8§ 

Z 0) 

<n 

■a E = 
62 



is 

a. a 

|1 

■5 2 
a. 

CD 



i_ c 

E *E 

CO CO 

CO CO 

8 8 

3 J3 

a> a> 

o o 

co to 



LO 
CM 

cr 

cr 
in 



CM 

CO 
CT 



CO 

co 
cr 



CO 



CM 
CL 
CO 



CM 

CT 
CM 



CM 

cr 



CL 



CO 



CO 






CD 


CM 


CD 




CO 


CD 


lO 




V 


O 


CM 


CM 


o 


o 












2 


2 



co 
oo 
co 

i 

< 



CD 



N 



O 
m 
DC 

CM 
CL 
CO- 
OL 



CO 
CD 
CO 

T— 

o 

CM 



CO 

*l 
I 

CO 
CM 



oo 
o 

CM 



CM 



3 
s 

CD 
O 
CM 



CM 



CO, 



o 

CM {CM 



WO 2004/111197 



PCT/US2004/018492 



43/81 



E 

cd 

CL 



•a 
c 
co 

D) 



O CO 

a 

'co 

So 

O -c- 

8 - 

i_ -Q 
E CD 

2 E 



CD 
CM 
CT 
CO 



CM 
CM 

CM 
cr 



CO 

LI- 



CO 
CO 
CO 
CM 
O 
CM 



CO 
=3 

o 
cu 
c 
cu 

CJ) 

e 

a) 
"S 

E 

QQ o 

Q. 
O 
Q) 

il 

o: o 

C -Q 



CM 

cr 



cr 



o 
& 
< 

CM 

a 
o 



«'3" 

<2 Q 

I o 

C £ 
CD 

■D C 

C — 

CD - 
Q- CD 

CD v~ 

•a cl 



CM 
CL 



CM 
CD 

CD '~T 
CD ~ 

2 & 

« % 

o JS 

3 



co 

CD 
O 



Si 



TO, 



CM 



CO 



CD 
CL 
CO 



c 

CD 
S 

CO 

_d aj 

J= CO 



CM 
cr 
X 



o 

CD 
CO 

B 

V) 
CO 



0 

CO 

3 CO 

co q) 

S E 

S & 

CD S 

to « 



CM 
CO 

CM 

oi 
X 



E ^ 
a 

13 o 



i 

Q 

CM 
CD 

"T 
CD 
CO 
CO 

c 



0> 



3 



CL 

a 

o 

CD 



LL 

c 

5 

2 

CL 

TO 
O 

a> 
sz 
*-> 
o 

CL 



c 
o 



a 

CO 

c 

2 



2 

CO 

> 
o 

CO 
T3 

c 

CO 



o 

CM 

! 



CL 
CD 
O 

E 

0) 
CL 

O 
O 



CD 

E 
o 

CL 
CO 

I 
i 

ID 

ja> 
a 
>* 
o 

c 
o 
"co 
■> 

1 



ID 
05 
CO 
ID 
Q_ 

a. 

c 

*CD 
•*— « 

2 

CL 

o 



o 

& 



CO 
CM 

cr 



ID 
CM 

en 
o 



CD 
O 
^* 

o 
o. 



ID 

CM 

7 
CM 
ID 
CM 

cr 



CM 



cr 



CM 

co 
cr 

CM 



CM 



CM 

CL 
CD 



CM 
CO 

CM 

cr 
o 



m 



CD 
CO 
O 

2 



ID 
—I 

a 
a 

LU 

cr 



CO 



CM 

co 

CO 
CO 
CO 

r^- 
LD 
DO 



ID 
CD 
CO 
h- 
ID 



< < 



CM 
CO 

h- 

CM 

. o. 



CO 



co 

i 



LL 



I 



O 

CM 

o 

2 



o 
a 



o 

CM 
CO 
CL 



CO 

O 
Q 
O 



0. 

CD 

CM 

>- 
CO 



CO 

CO 

o 

ID 
CM 
CO 

8 



CO 

X 

CO 
CO 
CD 

O 
CM 



to 

CO 

co 
o 
o 

CM 



(0 
CO. 



CM 



CO 
CM 
r— 
CM 



CD i 

U) 

CD 
LO 
CO 
CD 
CD 
CM 



CD 

a> 

CO 
ID 

a. 

CL 



CO, 



CO 
CD 
CO 

CM 



WO 2004/111197 



PCT/US2004/018492 



44/81 



IKIAA0582 orotein 1 


tissue inhibitor of metalloproteinase 1 
(erythroid potentiating activity, collagenase 
inhibitor) (TIMP1) 


putative tumor suppressor O.UCA15t I 


DKFZP434J21 4 protein 


dystrobrevin B DTN-B2 1 


surfactant, pulmonary-associated protein C 
(SFTPC) 


I glutamyl-prolvl-tRNA synthetase 


hypothetical protein FLJ21511 (FL J2151H 


alanyl (membrane) aminooeDtidase 


hairyenhancer-of-split related with YRPW motif 
1 (HEY1) 


hypothetical protein FLJ22489 I 


RAN binding protein 17 (RANBP17) i 


© 

w c 

5 a 
S Z 

CO 1- 
CD c 

II 

Q- =2 
w c 

0 9 

CL C 
CO 

If 

0 c 
(1) 

1 s 
s s 

S § 

m .c 


valosin-containinq protein 5 


T cell receptor gamma-chain mRNA, C2 
region 

similar to rat myomegalin 


r\iM/Auuoz protein 

tripartite motif-containing 3 

H-RYK receptor tyrosine kinase ' 


CD2 antigen (p50), sheep red blood cell 
receptor (CD2) 


homogentisate 1,2-dioxygenase (homogentisate 
oxidase) (HGD) 


nooKinase (KbSKj 

testis specific leucine rich repeat protein 
(TSLRP) 


hypothetical protein FLJ21062 (FLJ21062} 


hypothetical protein FLJ 12975 


tumor necrosis factor (ligand) superfamily, 
member 10 


J2p14 


ro 
CN 

T— 
T— 

CL 
i 

CO 

T— 
T— 

CL 

X 


|3p21.3 


CD 
CN 

cr 

CO 


I2p23-p22 


8p21 


CN 
cr 


CM 

T— 

CL 
*t 


CD 
CM 

D" 

1 

lO 
CM 
cr 
to 


8q21 


7q21-q22 


r 

CO c 
cr h 
10 t- 


1p36.11 


CM 
v- 

CL 

i 

CO 

CL 
CD 


If) CM t 

Q. cr c 
t- if 


11p15.5 
3q22 


1p13 


CO 
CM 
CTc 

T- O 

CM C 
cr C 

co r 


CM 

1 CY 

"> NT 
n| CM 
2. CT 
vj CO 


7p1l2 


CO 
CM 

cr 

CM 
r™ 


CD 
CM 
cr 
co 


AB011154 


NM 003254 


U23946 


AL556438 


IY15718 


00 

o 

CO 

o 
o 

2 


AI475965 


r^. 

OO 

o 

LO 

s 

2 


09 WOO" IAIN 


NM 012258 


AK026142 


£ 0 

CO rr 

CM r- 
°, c 

2 c 

2 DC 


BC001449 


W60953 I 


M27331 
AB042557 


AA1 14843 
X96588 


CO 
h- 

O 

5 

2 


r- a 
00 0 

T— V 
O CN 
O C> 
O C 

2 2 


NM 012472 


NMJJ24788 


NM_024809 ] 


AW474434 


RAB1A 


TIMP1 


£ t 


CN 
—> 

s 

sr 
X 
M 

£ 


2 


SFTPC 


i 


r— 


CL 
JJ 
X 
2 
< 


HEY1 


a 
3! 

D I 


CNP 


HNRPR 


a. 

> 


TRG@ 

PDE4DIP 

PPAP2A 


TRIM3 
RYK 


CD2 


HGD 
RBSK 


TSLRP 


FLJ21062 


FLJ12975 | 


TNFSF10 


"l 




CO 


V 

"l 


i— > - 
CO 


i—i 

5 




CO 

w " 


B 


CO 

tn 




t-< - 

5 


COW 


•4-1 


5 


W CO CO 


at 

s at 




co. ro 


"co. ' 


CO 


4— » 

I 


"l 

X 


212677 




201666 


NT I 

7) C 
O C 
- C 
3 i 
N C 


O t 
D C 
O C 
N L 

N C 


7) 
N 
O 

N 


205982 


N • 

^ c 
o r 
D c 
D c 
N C 


* c 
N C 
^ c 
D C 
N C 
N C 


x> 
0 
0 

N 
D 
N 


218839 


0 -< 
- c 
c 

d- < 

- 

M C 


208912 


208766 


D 
O 

M 


216920 
210305 
212896 


213885 
216976 


205831 


205221 
219222 


206483 


-0 ( 

-O ( 

- 

ZT> ( 
c- C 
"N C 


0 

D 
D 

N 


214329 



WO 2004/111197 



PCT/US2004/018492 



45/81 



putative gene with similarity to zinc finger proteins 


phosphatidylinositol glycan, class Q (PIGQ) [ 


rhodanese, thiosulfate sulfurtransferase | 


mitochondrion 


peptidyl-prolyl isomerase G (cyclophilln G) I 


RAB5C, member RAS oncogene family 
(RAB5C) 


HMG-box transcription factor TCF-3 


glycoprotein M6B 


DKFZp761H087 p53 inducible protein j 


putative transmembrane protein (HS1-2) I 


uteroglobin (UGB) I 


ribosomal protein L35a I 


neurogenic extracellular slit protein 


period homolog 1 (Drosophila) 


thyroid hormone receptor interactor 13 
(TRIP13) 


hepatic leukemia factor (HLF) 


nuclear transcription factor, X-box binding 1 
(NFX1) 


T-cell receptor rearranged beta-chain V-region 
(V-D-J) 


histone deacetylase 2 (HDAC2) I 


myosin XVB, pseudogene 


DKFZP586B2420 protein 


KIAA0740 protein | 


ubiquitin specific protease 14 (tRNA-guanine 
transglycosylase) 


RNA binding motif protein 8A 


protein kinase C, delta binding protein 


MK-STYX (LOC51657) ! 


lactotransferrin (LTF) | 


cyclic AMP phosphoprotein | 


glutamate decarboxylase 1 (brain, 67kD) 
(GAD1) 


21q11.1 


16p13.3 || 


22q13.1 | 


CO 
CO 

T- 

cr 

CO 

T— 


1 — 
CO 
CT 
CM 


CM 
CM 

- cr 


CM 
r- 
r— 
CL 

CM 


CM 

CM 
CM 
CL 

X 


5q34 | 


15q26.1 ! 


11q12.3-q13.1 | 


«_ 

0) 

CT 
i 

CD 
CM 
cr 
CO 


CM 

LO 

T— 

CL 


CM 
CL 

r- 
i 

T— 

CO 

T— 

CL 


5p15.2 


17q22 | 


9q34.11 


CO 
CO 

CM 
CO 

cr 


6q21 . 1 


17q25.3 


2q37.3 


CM 
CM 
cr 
O 


18p11.32 


1q12 


LO 

T— 

CL 

T— 
T~ 


7p12.3 


CO 
CM 

7 

CM 
cr 

CO 


|15q15.3 


2q31 


AL163202 


NMJ)04204 ] 


D87292 j 


o 

CO 
CO 
LO 

o 

i 

z 


CM 
CD 

r- 
o 
°1 

Z 


CO 
CO 
LO 

• 

o 
o 

z 


CO 
CO 
CM 

T— 

CO 

Z 


AF016004 ] 


AL161999 j 


o 

o 
o 

z 


LO 
CO 
CO 
O 

3 

Z 


AW517686 I 


AF055585 | 


AF022991 | 


CO 
CM 

o 
o 

Z 


M95585 I 


NM_002504 


M 15564 


r- 
CM 

LO 

t— 
O 

z 


AI825877 


AL050143 | 


AB018283 


BC003556 


AI738479 


AI088622 


CD 
CO 

o 

CD 

1 

Z 


NMJ302343 


BC003418 


h- 

CO 

o 
o 

i 


PRED65 


PIGQ 1 


TST I 


MRPS31 | 


PPIG ) 


RAB5C 


TCF-3 | 


GPM6B | 


CYFIP2 | 


ABHD2 | 


SCGB1A1 j 


RPL35A j 


SLIT2 | 


PER1 j 


TRIP13 


HLF | 


ODF2 


IGHM 


HDAC2 j 


MY015B ! 


DKFZP586B2420 | 


RHOBTB1 | 


USP14 


RBM8A 


PRKCDBP 


LOC51657 


LTF 


ARPP-19 


GAD1 


216469 at 


CO 
if) 

o 

CM 


•*—> 

LO 
O 
CD 
CD 

o 

CM 


:< 

o 

CD 
CM 

CM 


CO 

5 

CO 
CD 
CD 
CO 

o 

CM 


201140 s at 


.♦— » 

CO 

v> 

CD 

o 

x— 
CM 
CM 


CD 
CD 

T— 

CD 

8 


CO 

CO 

LO 
CO 
h- 
LO 

CM 


"co 

CD 
CD 
LO 
LO 

8 


|205725_at ( 


<*— > 

CO 

CD 
CO 

CM 

T— 

CM 


to 
tn 
r- 

CD 
CO 
CD 

o 

CM 


|36829_at | 


"3 

CO 
CO 

o 

§ 

CM 


CO 

X 

LO 
LO 

s 

CM 


«*-» 

M i 

LO 
CO 
LO 
CM 
O 
CM 


to 

X 

LO 
CD 

o 

T— 

CM 


|201833_at I 


59375_at j 


5 

CO 
CO 
CM 


|212651_at 


CO 

*. 

CO 

o 

CM 


-»— « 

CD 
t/) 
CO 

CM 


213010_at 


*! 

CM 
CO 
CO 

T— 

CM 


120201 8_s_at 


CO 

V) 

CM 
CO 

T— 

CM 
CM 


205278 at 



WO 2004/111197 



PCT/US2004/018492 



46/81 































•o 


c 










0) 


B 










ociat 












o 








CD 






o 




(0 


CO 










CO 








g 


CO 


B-UI 


E 


















Q 






CD 






FZ 


en 


tro 


lam' 










) to 




on 


Q 




dv 






"O 

2 
a 

0 
c 


protein 


dated ai 


eta 2 ( 


in (actir 


^~ 

>% 
c 
o 


a) 




_Q 


CP 




CT 
CO 
00 


Jtical 


asso 


C 

In 


prot 


main 


CD 




tz 


§-j= 


CO 

c 


doi 


i 


Q 






&iS 








CD 


£ 2 








ri 




CO fl) 








CO 


CO CL 


o -Q 










CO 






CM 






CM 








CM 


CO 


9" 




CO 


CO 


c\i 


CO 


CM 


CO 




CO 




CM 


CM 


Q. 


cr 


Q. 


cr 


CO 




CD 


«^ 


r- 


CO 


a 




T— 


T 


T— 




T— 






in 


o 


o 






14583 


T— 


0) 


lO 




03600 


CM 


CO 


N- 




O 
CM 
O 


o 
o 


CO 
. O 
O 


T— 

CM 


BO 








CO 

o 


o 
O 


< 


z: 






ZD 


CD 




























5 


































CO 
CD 


T — 

CD 




CM 


GO 




I 


N 




CO 


N 






h 


I 


H 


DL 


5 


< 






2: 












CO 




























s 






> — < 








■4—1 








-4-J 


CO 




5. 


5 


w i 












lO 


r> 


O ( 


M I 


n 








CO < 


O C 


ID 


CO 


2,! 


M 


5!! 


O C 


Z) 


in 


O ( 


J) 






o 


( 




CO C 


m|cm| 


CM 


CO ( 


M 



WO 2004/111197 



PCTYUS2004/018492 



47/81 



OMIM 






142385 




601880 


134934 


156354 




UNIGENE 


CM 

8 

CO 
CO 

X 


Hs.2258 


O) 
CO 

o 
in 

CM 

CO 

X 


Hs.8765 


o 

1 

CO 

X 


Hs.1420 


Hs.2667 


Hs.278462 


LOCUS 
LINK 


11170 


4319 


3131 


11325 


6376 


2261 


4496 


4501 


MAP 


§. 

CO 


11q22;3 


17q22 


CO 
CM 

cr 


CO 

T- 

cr 
co 


CO 
CD 


16q13 


16q13 


Functional category 


NULL 


collagen catabolism // 
inferred from electronic 
annotation 


regulation of 
transcription, DNA- 
dependent // inferred 
from electronic 
annotation 


cytoplasm // non- 
traceable author 
statement 


immune response // not 
recorded 


skeletal development // 
predicted/computed 


metal ion binding // 
inferred from electronic 
annotation 


heavy metal 
sensitivity/resistance // 
traceable author - 
statement 


GENBANK 
DESCRIPTION 


TU3A protein 


matrix 

metalloproteinase 10 
(stromelysin 2) 


hepatic leukemia 
factor 


RNA helicase-related 
protein 


small inducible 
cytokine subfamily D - 
(Cys-X3-Cys), 
member 1 
(fractalkine, 
neurotactin) 


fibroblast growth 
factor receptor 3 
(achondroplasia, 
thanatophoric 
dwarfism) 


metailothionein 1H 


metailothionein 1X 


Fold Change 


0.25762095 


0.39706706 


0.4701625 


0.52723124 


n CICCQDflE 


0.54726067 


0.55093068 


0.55320327 


Up or 
Down 
regulation 


Down 


Down 


Down 


Down 


Down 


Down 


Down 


Down 


Smoker_NS 
moker_p 
, Value 


§ 
o 
o 
o 
o 
o 
d 


0.00000004 


o 


0.00000781 


0.00000006 


0.00003694 


0.00002215 


0.00000031 


GB_ACC 


AL050264. 
1 


NM 00242 
5.1 


M95585.1 


AF078844. 


1^ 

Z CD 


r— 
O 
O 

Z CM 


in 

CD 

m 
o 

Z T 


m 

CD 

m 
o 
o 

2 - 

Z CM 


2 

UJ 

z 

LU 

O 




MMP10 


HLF 




SCYD1 


FGFR3 


MT1H 


MT1X 


Gene list 
BU.AFFXJD 


CO 
O 

8 


5 

CO 
CO 

in 

s 


in 
m 

1 


«1 

X 

a 

m 
o 

CM 




5 

Oi 
CO 

1 


i 

CD 
CO 

8 


"l 
*l 

T" 

CO 

in 

§ 

CN 



WO 2004/111197 



PCT/US2004/018492 



48/81 



lAIIIAIO 


601880 


142385 




OUT boo 


308700 


603746 


603349 




UNIGENE 


l 

i 

Hs.80420 


Hs.250692 




oo 

CO 

to 
X 


Hs.89591 


Hs.29802 


CO 
CO 

00 

05 

X 


CM 

in 

^3" 
LO 
N- 

to 
X 


LOCUS 
LINK 


6376 


3131 




3248 


3730 


9353 


2034 




MAP 


|16q13 


CM 

! 




to 

CO 

i 

CT 


CO 

1 


CM 

to 


CO 

9- 

CM 




Functional category 


immune response // not 
recorded 


regulation of 
transcription, DNA- 
dependent // inferred 
from electronic 
annotation 


cytoplasm If non- 
traceable author 
statement 


metabolism // Inferred 
from electronic 
annotation 


cell motility // traceable 
author statement 


mesoderm migration // 
inferred from mutant 
phenotype 


NULL 


spermatid development 
// traceable author 
statement 


GEN BANK 
DESCRIPTION 


small inducible 
cytokine subfamily D 
(Cys-X3-Cys), 
member 1 
(fractalkine, 
neurotactin) 


hepatic leukemia 
factor 


RNA hellcase-related 
protein 


hydroxyprostaglandin 
dehydrogenase 15- 
(NAD) 


Kallmann syndrome 1 
sequence 


slit homolog 2 
(Drosophila) 




heat shock 70kD 
protein 2 


Fold Change 




0.56297333 


0.56721444 


0.56944096 


0.58642444 


0.58826137 


0.59071696 


CO 
CM 

CO 

o> 
in 
d 


Up or 
Down 
regulation 


Down 


Down 


Down 


Down 


Down 


Down 


Down 


Down 


Smoker_NS 
moker _p 
Value 


o 


0.00000017 


0.00000001 


0.00003925 


0.00003575 


0.00001259 


0.00000045 


0.00000016 


O 

S 
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o 

5 

H CO 
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AF052094. 


U56725.1 


1, 

UJ 

UJ 
O 


SCYD1 


HLF 




HPGD 


KAL1 


SLIT2 


EPAS1 
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2 

I 
O 


156353 


300222 


i 


156352 


156358 


602570 




603742 






UNIGENE 


Hs.334409 


Hs.17109 


Hs.258503 


Hs.203936 


Hs.94360 


Hs.166154 


CO 

5 

03 

X 


CD 
CD 

CO 

to 

r- 

co 
X 




CO 
N. 
CD 
CD 

to 
X 


LOCUS 
LINK 


in 
o> 

5 


9452 




4494 


4500 


3714 


60494 


6585 






MAP 


16q13 


CO CM 

CO T— 

CM 
or cr 

! X X 




CO 

s 


CO 
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CO 

cr 


cr 


CO 
CO 
CM 

cr <<t 

O CM 

t- cr 






Functional category 


metal ion binding // 
inferred from electronic 
annotation 


integral to membrane // 
inferred from electronic 
annotation 


NULL 


cytoplasm // non- 
traceable author 
statement 


cytoplasm // non- 
traceable author 
statement 


N receptor signaling 
pathway // non- 
traceable author 
statement 


NULL 


NULL 


cytokinesisV/inferred 
from electronic 
annotation 


NULL 


GENBANK_ 
DESCRIPTION 


metaliothionein 1G 




cytoplasmic FMR1 
interacting protein 2 




serine threonine 
kinase 39 
(STE20/SPS1 
homolog, yeast) 


jagged 2 


hypothetical protein 
FU23514 


slit homolog 1 
(Drosophiia) 


cyclin D2 


Homo sapiens cDNA 
FU32766 fis, clone 
TESTI2001862 


Fold Change 


0.60076773 


0.60146828 


0.60939296 


0.61734582 


0.61891557 


0.63262088 


0.66833583 


0.67196823 


0.6749563 


0.68035154 


Up or 
Down 
regulation 


Down 


Down 


Down 


Down 


Down 


Down 


o 
Q 


Down 


Down 


Down 


to 

CD lJ 

E o o 
co E > 


0.00000147 


0.00005283 


0.00000053 


0.00000031 


rsi 
o 
o 
o 
o 
o 
o 
o 
d 


0.00002692 


0.00000047 


0.00000089 


0.00000909 


0.00006342 


O 

l 
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o 
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m 
I s * 

O 
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OMIM 


600848 


186845 






CN 

s 




1 

cn 
a 
a 

CO 




UNIGENE 


Hs.287994 


I 

£ 


Hs.173717 


Hs.6721 


Hs.74304 


Hs.75428 


!Hs.78596 




LOCUS 
LINK 


9612 






11343 


! 

i 

5493 


6647 




5693 




MAD 


s 

c 

CN 

T— 


11p15 




3q21.3 


CO 
CO 

s 


T- 

CN 


CN 

o 




Functional category 


DNA binding activity// 
extended: Unknown; 
myb DNA-binding; 3.9e- 
10 


protein complex 
assembly // not 
recorded 


NULL 


inflammatory response 
// traceable author 
statement 


cytoskeleton // 
traceable author 
statement 


response to oxidative 
stress // not recorded 


NULL 


cyclic nucleotide 
metabolism // 
experimental evidence 


GEN BANK 
DESCRIPTION 


nuclear receptor co- 
repressor 2 


CD81 antigen (target 
of antiproliferative 
antibody 1) 


phosphatidic acid 
phosphatase type 2B 


monoglyceride lipase 


periplakln 


superoxide dismutase 
1, soluble 

(amyotrophic lateral 
sclerosis 1 (adult)) 


proteasome 
(prosome, 
macropain) subunit, 
beta type, 5 




Fold Change 


0.74126481 


0.74141757 


0.7547925 


0.75734151 


0.79282943 


1 9^no7e;qft 


1.23472792 


1.24764073 


Up or 
Down 

regulation . 


Down 


Down 


Down 


Down 


Down 


Up 


a. 


Up 


Smoker_NS 

moker_p 

Value 


0.00003465 


0.00000198 


0.00006297 


0.00000672 


0.0000534 


0.00002194 


0.00003406 


0.00003238 
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Hs.75722 


G) 
GO 
CD 
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X 


Hs.75S1fl 


cc 
cc 
cr 
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OC 
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*- 
CN 
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u 
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00 
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CO 
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CN 
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LOCUS 
LINK 
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8766 i 
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I 
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10549 
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11057 
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Fvj/Ar 


^ **: 

cr co 
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CM cr 
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CM CM 
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CM m . 
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CO 
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■ m 
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cr M" 
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O 
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Functional category 


protein modification // 
traceable author 
statement 


ATP binding activity // 
extended: Unknown; 
HATPase c;5.4e-13 


intracellular protein 
transport // inferred from 
electronic annotation 


central nervous system 
development// 
predicted/computed 


cell motility // not 
recorded 


l-kappaB 

phosphorylation // 
traceable author 
statement 


Intracellular protein 
transport// inferred from 
electronic annotation 


NULL 


integral to membrane // 
non-traceable author 
statement 


GENBANK_ 
DESCRIPTION 


ribophorin II 


tumor rejection 
antigen (gp96) 1 


RAB11A, member 
RAS oncogene family 


6- 

pyruvoyltetrahydropte 
rin synthase 


trophoblast 
glycoprotein 


peroxiredoxin 4 


sortilin-related 
receptor, L(DLR 
class) A repeats- 
containing 


NAD kinase 


lung alpha/beta 
hydrolase 2 


Fold Change 


1.26443482 


1.28607041 


1.28734777 


1.29325473 


1.30815003 


1.31811994 


1.34981345 


1.35785099 


1.36323619 


Up or 
Down 
regulation 


a. 


CL 


CL 


Q. 


CL 
ZD 


CL 


CL 
Z> 


a 
z> 


Q. 


CO 

co J 
^ cd a> 

O ^ Z3 

E o ro 
co E > 


0.00002416 


0.0000016 j 


0.0000151 


0.00003398 


0.00002117 


0.00001455 


0.00002403 


0.00001625 


0.00004071 
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OMIM 


158343 


123841 








103880 


603603 








UNIGENE 


Hs.89433 


Hs.699 


Hs.10267 


Hs.220324 


Hs.17144 


Hs.75313 


Hs.144442 


Hs.21542 


Hs.38972 


CO 
CO 

CO 

X 


LOCUS 
LINK 


4363 


5479 


23786 




9249 


CO 
CM 


8399 


23287 


10103 


11163 


MAP 


CO 
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Functional category 


small molecule 
transport// traceable 
author statement 


protein folding // 
inferred from electronic 
annotation 


caspase activation // 
non-traceable author 
statement 


NULL 


metabolism // inferred 
from electronic 
annotation 


I carbohydrate 
metabolism // traceable 
author statement 


lipid catabolism // 
inferred from electronic 
annotation 


NULL 


cell proliferation // not 
recorded 


cyclic nucleotide 
metabolism // 
experimental evidence 


GENBANK 
DESCRIPTION 


ATP-binding 
cassette, sub-family 
C (CFTR/MRP), 
member 1 


pepUdylprolyl 
isomerase B 
(cyclophilin B) 


MIL1 protein 


NAD kinase 


short-chain 
dehydrogenase/reduc 
tase 1 


aldo-keto reductase 
family 1 , member B1 
(aldose reductase) . 


phosphoiipase A2, 
group X 


ATP/GTP binding 
protein 1 


tetraspan 1 


nudlx (nucleoside 
diphosphate linked 
moiety X)-type motif 4 


Fold Change 


1.36862413 


1.37577594 


1.37660816 


1.37784082 


1.38649421 


1.4007013 


1.42588829 


1.43452772 


1.4367707 


1.43822734 


Up or 
Down 
regulation 


Up 


Up 


Up 


Up 


Up 


Up 


Up 


Up 


Up 


Up 


Smoker_NS 

moker_p 

Value 


0.00005085 


0.00004739 


0.00005905 


0.00000238 


0.00000325 


0.0000218 


0.00002852 


0.00001725 


0.00000494 
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Functional category 


regulation of 
transcription, DNA- 
dependent // inferred 
from electronic 
annotation 1 * 


signal transduction // 
traceable author 
statement 


carbohydrate 
metabolism // traceable 
author statement 


threonyl-tRNA 
aminoacylalion // 
inferred from electronic 
annotation 


metabolism // traceable 
author statement 


induction of apoptosis // 
experimental evidence 


regulation of 
transcription, DNA- 
dependent // inferred 
from electronic 
annotation 


GENBANK_ 
DESCRIPTION 


hypothetical protein 
FU23407 


inositol(myo)-1 (or 4)- 
monophosphatase 2 


GDP-mannose 4,6- 
dehydratase 


threonyl-tRNA 
synthetase 


UDP 

glycosyltransferase 1 
family, polypeptide A9 


HIV-1 Tat interacUve 
protein 2, 30 kD 


v-maf . 

musculoaponeurotic 
fibrosarcoma 
oncogene homolog G 
(avian) 


Fold Change 


1.50397769 


1.50470629 


1.5059961 


1.5344928 


1.54255992 


1.54504237 


1 54913013 


Up or 
Down 
regulation 


Up 


Up 


Up 


Up 


Up 


Up 


Up 
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z 
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co E > 


0.00002754 


0.00002945 
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0.00002058 


0.0000014 
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0.00001062 
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Functional cateaorv 


signal transduction // 
not recorded 


metabolism // traceable 
author statement 


cell proliferation // 
traceable author 
statement 


endocytosis // inferred 
from electronic 
annotation 


carbohydrate 
metabolism // traceable 
author statement 


regulation of cell growth 
II inferred from 
electronic annotation 


NULL 


GENBANK_ 
DESCRIPTION 


S100 calcium binding 
protein A10(annexin 
II ligand, calpactin I, 
light polypeptide 
(P1D) 


UDP 

glycosyltransferase 1 
family, polypeptide A3 


peroxiredoxin 1 


adaptor-related 
protein complex 1, 
gamma 1 subunit 


UDP-N-acetyl-alpha- 
D- 

galactosaminerpolype 
ptide N- 

acetylgaiactosaminylt 
ransferase 7 (GalNAc 
T7) 


connective tissue 
growth factor 


ferritin, heavy 
polypeptide 1 


Fold Change 


1.59972573 


1.61820133 


1.62489119 


1.62599641 


1.63625819 


1.65588494 


1.6618808 | 


Up or 
Down 
regulation 


Up 


Up 


Up 


Up 


Up 
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Up 


co 

CD J 
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E £ ca 
to E > 
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0.00003102 


0.00000042 
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0.00000011 


0.00005544 
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OMIM 


602365 


606358 


191740 


601168 




603263 


230450 


277730 


602oo4 


UNIGENE 


Hs.10029 


Hs.91011 


Hs.278898 


Hs.74566 


Hs.109706 


Hs.5338 
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X 


Hs.105435 
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Functional category 


proteolysis and 
peptidolysis // not 
recorded 


oncogenesis // 
traceable author 
statement 


transferase activity, 
transferring hexosyl 
groups // 

extended:Unknown; 
UDPGT; 0 


signal transduction // 
traceable author 
statement 


NULL 


one-carbon compound 
metabolism // inferred 
from electronic 
annotation 


cysteine metabolism // 
traceable author 
statement 


transferase activity // 
inferred from electronic 
annotation 


carbohydrate 
metabolism // traceable 
author statement 
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Functional category 


cAMP-dependent 
protein kinase, regulator 
activity // 

extended:inferred from 
eiectronic annotation; 
Rlla; 4.5&-15 


pathogenesis // 
traceable author 
statement 


electron transport // 
inferred from electronic 
annotation I 


pentose-phosphate 
shunt, oxidative branch 
// traceable author 
statement" 


cytoplasm // not 
recorded 


cysteine metabolism // 
traceable author 
statement 


NULL 


signal transduction // 
traceable author 
statement 


physiological processes 
// non-traceable author 
statement 
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calcium-binding 
tyrosine-(Y)- 
phosphorylation 
regulated 
(fibrousheathin 2) 


transmembrane 4 
superfamily member 
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thioredoxin reductase 
1 


, , , 

phosphogluconate 
dehydrogenase 


annexin A3 


glutamate-cysteine 
ligase, modifier 
subunit 
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Functional category 


regulation of 
transcription, DNA- 
dependent// traceable 
author statement 


metabolism // inferred 
from electronic 
annotation 


tricarboxylic acid cycle 
// not recorded 


integral to plasma 
membrane // inferred 
from direct assay 


transcription from Pol II 
promoter //traceable 
author statement 


xenobiotic metabolism 
// traceable author 
state mpnt 


NULL 


electron transport // 
inferred from electronic 
annotation 
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alcohol 

dehydrogenase 7 
(class IV), mu or 
sigma polypeptide 


carbonyl reductase 1 


malic enzyme 1, 

NADP(+)-dependent, 

cytosolic 


transmembrane 4 
superfamily member 
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Pirin 


aldo-keto reductase 
family 1 , member C1 
(dihydrodioi 
dehydrogenase 1 ; 20- 
alpha (3-alpha)- 
hydroxysteroid 
dehydrogenase) 


ESTs 
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Functional category 


lipid metabolism // 
traceable author 
statement 


xenobiotic metabolism 
// traceable author 
statement 


protein binding // 
inferred from physical 
i interaction 


| : 

response to oxidative 
stress // inferred from 
electronic annotation 


cell adhesion // 
experimental evidence 


electron transport // 
inferred from electronic 
annotation 


NULL I 


protein binding // 
inferred from physical 
interaction 


electron transport // 
inferred from electronic 
annotation 


GENBANK 
DESCRIPTION 


aldo-keto reductase 
family 1, member C3 
(3-alpha 
hydroxysteroid 
dehydrogenase, type 




mucin 5, subtype B, 
tracheobronchial 


glutathione 
peroxidase 2 
(gastrointestinal) 


claudin 10 


NAD(P)H 
dehydrogenase, 
quinone 1 




mucin 5, subtypes A 
and C, 

tracheobronchial/gast 

iric 


NAD(P)H 
dehydrogenase, 
quinone 1 


Fold Change 


3.14061275 


3.37268899 


3.38189534 


3.42949443 


3.4329732 


3.46737099 


3.55079646 | 


3.62489718 


4.0084017 
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MAP 
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Functional category 


electron transport // 
inferred from electronic 
annotation 


electron transport // 
inferred from electronic 
annotation 


electron transport// 
inferred from electronic 
annotation 
















GEN BAN K_ 
DESCRIPTION 


cytochrome P450, 
subfamily I (dioxin- 
inducible), 
polypeptide 1 
(glaucoma 3, primary 
infantile) 


cytochrome P450, 
subfamily I (dioxin- 
inducible), 
polypeptide 1 
(glaucoma 3, primary 
infantile) 


cytochrome P450, 
subfamily I (dioxin- 
inducible), 
polypeptide 1 
(glaucoma 3, primary 
infantile) 
















Fold Change 
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creatine kinase, brain 


phosphogluconate dehydrogenase /// phosphogluconate 
dehydrogenase 


integrin-linked kinase 


thioredoxin reductase 1 


eukaryotic translation initiation factor 4E 


NAD(P)H dehydrogenase, quinone 1 


NAD(P)H dehydrogenase, quinone 1 


nuclear factor of kappa light polypeptide gene enhancer in B-cells 
inhibitor, alpha 


carcinoembryonic antigen-related cell adhesion molecule 5 


lactotransferrin 


growth arrest-specific 6 


SWI/SNF related, matrix associated, actin dependent regulator of 
chromatin, subfamily a, member 5 


a disintegrin and metalloproteinase domain 9 (meltrin qamma) 


"' 

cytochrome P450, family 1, subfamily B, polypeptide 1 


cytochrome P450, family 1, subfamily B, polypeptide 1 
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